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CROSS-REFERENCE TO RELATED APPLICATIONS 

5 This application claims priority to U.S. provisional application Serial No. 

60/1 52,493, filed August 30, 1999 and herein incorporated by reference in its entirety for 
all purposes. 

BACKGROUND 

10 Technical Field 

The present invention is directed to plant acyltransferase-like nucleic acid and 
amino acid sequences and constructs, and methods related to their use in altering sterol 
composition and/or content, and oil composition and/or content in host cells and plants. 

15 Related Art 

Through the development of plant genetic engineering techniques, it is now possible 
to produce transgenic varieties of plant species to provide plants which have novel and 
desirable characteristics. For example, it is now possible to genetically engineer plants for 
tolerance to environmental stresses, such as resistance to pathogens and tolerance to 
2 0 herbicides. It is also possible to improve the nutritional characteristics of the plant, for 
example to provide improved fatty acid, carotenoid, sterol and tocopherol compositions. 
However, the number of useful nucleotide sequences for the engineering of such 
characteristics is thus far limited. 

There is a need for improved means to obtain or manipulate compositions of sterols 

2 5 from biosynthetic or natural plant sources. The ability to increase sterol production or alter 

the sterol compositions in plants may provide for novel sources of sterols for use in human 
and animal nutrition. 

Sterol biosynthesis branches from the famesyl diphosphate intermediate in the 
isoprenoid pathway. Sterol biosynthesis occurs via a mevalonate dependent pathway in 

3 0 mammals and higher plants (Goodwin,( 1 98 1 ) Biosynthesis of Isoprenoid Compounds, vol 

1 (Porta:, J.W. & Spurgeon, S.L., eds) pp.443-480, John Wiley and Sons* New York>, 
while in green algae sterol biosynthesis is thought to occur via a mevalonate independent 
pathway (Schwender, et aL (1997) Physiology, Biochemistry, and Molecular Biology of 
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Plant Lipids, (Williams, J.P., Khan, M.U., and Lem, N.W., eds) pp. 180-182, Kluwer 
Academic Publishers, Norwell, MA). 

The solubility characteristics of sterol esters suggests that this is the storage form of 
sterols (Chang, et al y (1997) Annu. Rev. Biochem., 66:613-638). Sterol O-acyltransferase 
5 enzymes such as acyl CoAxholesterol acyltransferase (ACAT) and lecithinxholesterol 
acyltransferase (LCAT) catalyze the formation of cholesterol esters, and thus are key to 
controlling the intracellular cholesterol storage. In yeast, it has been reported that 
overexpression of LROl, a homolog of human LCAT, and phospholipid:diacylglycerol 
acyltransferase increased lipid synthesis (Oelkers et al., (2000) J. Biol Chern., 26:15609- 
10 15612; Dahlqvist et aL, (2000) Proa Natl. Acad Sci. USA, 97:6487-6492). 

The characterization of various acyltransferase proteins is useful for the further 
study of plant sterol synthesis systems and for the development of novel and/or alternative 
sterol sources. Studies of plant mechanisms may provide means to further enhance, 
control, modify, or otherwise alter the sterol composition of plant cells. Furthermore, such 
1 5 alterations in sterol content and/or composition may provide a means for obtaining 

tolerance to stress and insect damage. Of particular interest are the nucleic acid sequences 
of genes encoding proteins which may be useful for applications in genetic engineering. 

SUMMARY OF THE INVENTION 

The present invention is directed to lecithinxholesterol acyltransferase-like 
polypeptides (also referred to herein as LCAT) and acyl CoAxholesterol acyltransferase- 
like polypeptides (also referred to herein as ACAT). In particular the invention is related 
to polynucleotides encoding such sterokacyltransferases, polypeptides encoded by such 
polynucleotides, and the use of such polynucleotides to alter sterol composition and oil 
production. The polynucleotides of the present invention include those derived from plant 
sources. 

One aspect of the invention, therefore, is an isolated nucleic acid sequence 
encoding a plant lecithinxholesterol acyltransferase-like polypeptide, a fragment of a plant 
lecithinxholesterol acyltransferase-like polypeptide, a plant acyl CoAxholesterol 
acyltransferase-like polypeptide or a fragment of a plant acyl CoAxholesterol 
acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 
SEQ ID NO: 2, 4, 6, 8, 10-29, 43-51, 73 or 75. Also provided is an isolated nucleic acid 
sequence consisting of SEQ ID NO: 2, 4, 6, 8, 10-29, 43-51, 73 or 75. 


25 
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Still another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 3 or SEQ ID NO: 3 with at least one conservative amino acid 
substitution; SEQ ID NO: 2; an isolated polynucleotide that has at least 70%, 80%, 90%, 
5 or 95% sequence identity with SEQ ID NO: 2; an isolated polynucleotide of at least 10 
amino acids that hybridizes under stringent conditions to SEQ ID NO: 2; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 2 and encodes a plant 
lecithinxholesterol acyltransferase-like polypeptide. 

1 0 Still another aspect provides an isolated nucleic acid sequence consisting 

essentially of a polynucleotide of the formula 5* X^^-^R^Rg^-Y 3' where X is a 
hydrogen, Y is a hydrogen or a metal, Rj and R 2 are any nucleic acid, n is an integer 
between 0-3000, and R^ is selected from the group consisting of an isolated polynucleotide 
encoding a polypeptide of SEQ ID NO: 3 or SEQ ID NO: 3 with at least one conservative 

15 amino acid substitution; SEQ ID NO: 2; an isolated polynucleotide that has at least 70%, 
80%, 90%, or 95% sequence identity with SEQ ID NO: 2; an isolated polynucleotide of at 
least 10 amino acids that hybridizes under stringent conditions to SEQ ID NO: 2; an 
isolated polynucleotide complementary to any of the foregoing; and an isolated 
polynucleotide that hybridizes under stringent conditions to SEQ ID NO: 2 and encodes a 

2 0 plant lecithinxholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO:5 or SEQ ID NO: 5 with at least one conservative amino acid 
substitution; SEQ ID NO: 4; an isolated polynucleotide that has at least 70%, 80%, 90%, 
25 or 95% sequence identity with SEQ ID NO: 4; an isolated polynucleotide of at least 1 0 
ammo acids that hybridizes under stringent conditions to SEQ ID NO: 4; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 4 and encodes a plant 
lecithinxholesterol acyltransferase-like polypeptide. 

3 0 Another aspect provides an isolated nucleic acid sequence consisting essentially of 

a polynucleotide of the formula 5* X^R^^R^R^-Y 3' where X is a hydrogen, Y is a 
hydrogen or a metal, Ri and are any nucleic acid, n is an integer between 0-3000, and 
Rs is selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 5 or SEQ ID NO: 5 with at least one conservative amino acid 
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substitution; SEQ ID NO: 4; an isolated polynucleotide that has at least 70%, 80%, 90%, 
or 95% sequence identity with SEQ ID NO: 4; an isolated polynucleotide of at least 10 
amino acids that hybridizes under stringent conditions to SEQ ID NO: 4; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
5 hybridizes under stringent conditions to SEQ ID NO: 4 and encodes a plant 
lecitbinxholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO:7 or SEQ ID NO: 7 with at least one conservative amino acid 

1 0 substitution; SEQ ID NO: 6; an isolated polynucleotide that has at least 70%, 80%, 90%, 
or 95% sequence identity with SEQ ID NO: 6; an isolated polynucleotide of at least 1 0 
amino acids that hybridizes under stringent conditions to SEQ ED NO: 6; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 6 and encodes a plant 

1 5 lecithin:cholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 
a polynucleotide of the formula 5* X-(R,) n -(R 2 ) n .(R,) 0 -Y 3' where X is a hydrogen, Y is a 
hydrogen or a metal, R, and R 2 are any nucleic acid, n is an integer between 0-3000, and 
R 2 is selected from the group consisting of an isolated polynucleotide encoding a 

2 0 polypeptide of SEQ ID NO: 7 or SEQ ID NO: 7 with at least one conservative amino acid 
substitution; SEQ ID NO: 6; an isolated polynucleotide that has at least 70%, 80%, 90%, 
or 95% sequence identity with SEQ ED NO: 6; an isolated polynucleotide of at least 10 
amino acids that hybridizes under stringent conditions to SEQ ED NO: 6; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 

2 5 hybridizes under stringent conditions to SEQ ID NO: 6 and encodes a plant 

lecithinxholesterol acyltransferase-like polypeptide. 

. Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ED NO:9 or SEQ ED NO: 9 with at least one conservative amino acid 

3 0 substitution; SEQ ED NO: 8; an isolated polynucleotide that has at least 70%, 80%, 90%, 

or 95% sequence identity with SEQ ED NO: 8; an isolated polynucleotide of at least 10 
amino acids that hybridizes under stringent conditions to SEQ ED NO: 8; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
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hybridizes under stringent conditions to SEQ ID NO: 8 and encodes a plant 
lecithinxholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 
a polynucleotide of the formula 5* X^^-^-^-Y 3* where X is a hydrogen, Y is a 
5 hydrogen or a metal, R, and R 2 are any nucleic acid, n is an integer between 0-3000, and 
Rj is selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 9 or SEQ ID NO: 9 with at least one conservative amino acid 
substitution; SEQ ID NO: 8; an isolated polynucleotide that has at least 70%, 80%, 90%, 
or 95% sequence identity with SEQ ID NO: 8; an isolated polynucleotide of at least 10 
1 0 amino acids that hybridizes under stringent conditions to SEQ ID NO: 8; an isolated 

polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 8 and encodes a plant 
lecithinxholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
1 5 polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO:74 or SEQ ID NO: 74 with at least one conservative amino 
acid substitution; SEQ ID NO: 73; an isolated polynucleotide that has at least 70%, 80%, 
90%, or 95% sequence identity with SEQ ID NO: 73; an isolated polynucleotide of at least 
10 amino acids that hybridizes under stringent conditions to SEQ ID NO: 73; an isolated 
2 0 polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 73 and encodes a plant 
lecithinxholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 
a polynucleotide of the formula 5' X-^^R^-CR^-Y 3* where X is a hydrogen, Y is a 
2 5 hydrogen or a metal, R t and R 2 are any nucleic acid, n is an integer between 0-3000, and 
R 2 is selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 74 or SEQ ID NO: 74 with at least one conservative amino 
acid substitution; SEQ ID NO: 73; an isolated polynucleotide that has at least 70%, 80%, 
90%, or 95% sequence identity with SEQ ID NO: 73; an isolated polynucleotide of at least 
30 10 amino acids that hybridizes under stringent conditions to SEQ ID NO: 73; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 73 and encodes a plant 
lecithinxholesterol acyltransferase-like polypeptide. 
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Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO:76 or SEQ ID NO: 76 with at least one conservative amino 
acid substitution; SEQ ID NO: 75; an isolated polynucleotide that has at least 70%, 80%, 
5 90%, or 95% sequence identity with SEQ ID NO: 75; an isolated polynucleotide of at least 
10 amino acids that hybridizes under stringent conditions to SEQ ID NO: 75; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ED NO: 75 and encodes a plant 
lecithinxholesterol acyltransferase-like polypeptide. 
^10 Another aspect provides an isolated nucleic acid sequence consisting essentially of 

a polynucleotide of the formula 5* X-(R 1 ) ll -(R 2 ) n -(R3) B -Y 3' where X is a hydrogen, Y is a 
hydrogen or a metal, R ( and R 2 are any nucleic acid, n is an integer between 0-3000, and 
R 2 is selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 76 or SEQ ID NO: 76 with at least one conservative amino 

15 acid substitution; SEQ ID NO: 75; an isolated polynucleotide that has at least 70%, 80%, 
90%, or 95% sequence identity with SEQ ID NO: 75; an isolated polynucleotide of at least 
10 amino acids that hybridizes under stringent conditions to SEQ ID NO: 75; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ED NO: 75 and encodes a plant 

2 0 lecithinxholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of SEQ ID NO: 42 or a degenerate 
variant thereof; an isolated polynucleotide that has at least 70%, 80%, 90%, or 95% 
sequence identity with SEQ ID NO: 42; an isolated polynucleotide of at least 10 amino 

25 acids that hybridizes under stringent conditions to SEQ ID NO: 42; an isolated 

polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 42 and encodes an acyl 
CoAxholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 

30 a polynucleotide of the formula 5 * X^ l ) B -(R 2 ) n -<R s ) B -Y 3 ' where X is a hydrogen, Y is a 
hydrogen or a metal, R t and Rt are any nucleic acid, n is an integer between 0-3000, and 
R 2 is selected from the group consisting of SEQ ID NO: 42 or a degenerate variant thereof; 
an isolated polynucleotide that has at least 70%, 80%, 90%, or 95% sequence identity with 
SEQ ID NO: 42; an isolated polynucleotide of at least 10 amino acids that hybridizes 
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under stringent conditions to SEQ ID NO: 42; an isolated polynucleotide complementary ' 
to any of the foregoing; and an isolated polynucleotide that hybridizes under stringent 
conditions to SEQ ID NO: 42 and encodes a acyl CoAxholesterol acyltransferase-like . 
polypeptide. 

5 Also provided is a recombinant nucleic acid construct comprising a regulatory 

sequence operably linked to a polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide and/or an acyl CoAxholesterol acyltransferase-like 
polypeptide. In one embodiment, the sterol acyl transferases are plant sterol acyl 
transferases. In another embodiment, the recombinant nucleic acid constructs further 

1 0 comprises a termination sequence. The regulatory sequence can be a constitutive 

promoter, an inducible promoter, a developmentally regulated promoter, a tissue specific 
promoter, an organelle specific promoter, a seed specific promoter or a combination of any 
of the foregoing. Also provided is a plant containing this recombinant nucleic acid 
construct and the seed and progeny from such a plant This recombinant nucleic acid 

1 5 construct can also be introduced into a suitable host cell to provide yet another aspect of 
the invention. If the host cell is a plant host cell, the cell can be used to generate a plant to 
provide another aspect of the invention. Further aspects include seed and progeny from 
such a plant. 

Yet another aspect is a purified polypeptide comprising, SEQ ID NO: 3, SEQ ID 

2 0 NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 74, SEQ ID NO: 76, or any of the 

preceding sequences with at least one conservative amino acid substitution. 

Still another aspect provides a purified immunogenic polypeptide comprising at 
least 10 consecutive amino acids from an amino acid sequence selected from the group 
consisting of SEQ ID NO: 3, 5, 7, 9, 74, 76 and any of the preceding sequences containing 
25 at least one conservative amino acid substitution. Also provided are antibodies, either 

polyclonal or monoclonal, that specifically bind the preceding immunogenic polypeptides. 

One aspect provides a method for producing a lecithinxholesterol acyltransferase- 
like polypeptide or an acyl CoAxholesterol acyltransferase-like polypeptide comprising 
culturing a host cell containing any recombinant nucleic acid construct of the present 

3 0 invention under condition permitting expression of said lecithinxholesterol 

acyltransferase-like polypeptide or acyl CoAxholesterol acyltransferase-like polypeptide. 

Another aspect provides a method for modifying the sterol content of a host cell, 
comprising transforming a host cell with a recombinant construct containing a regulatory 
sequence operably linked to a polynucleotide encoding a lecithinxholesterol 
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acyltransferase-like polypeptide and culturing said host cell under conditions wherein said 
host cell expresses a lecithinxholesterol acyltransferase-like polypeptide such that said 
host cell has a modified sterol composition as compared to host cells without the 
recombinant construct 

5 An additional aspect is a method for modifying the sterol content of a host cell 

comprising transforming a host cell with a recombinant construct containing a regulatory 
sequence operably linked to a polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide and culturing said host cell under conditions wherein said 
host cell expresses an acyl CoAxholesterol acyltransferase-like polypeptide such that said 

1 0 host cell has a modified sterol composition as compared to host cells without the 
recombinant construct. 

A further aspect is a plant comprising a recombinant construct containing a 
regulatory sequence operably linked to a polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide wherein expression of said recombinant construct results 

15 in modified sterol composition of said plant as compared to the same plant without said 
recombinant construct. 

Another aspect provides a plant comprising a recombinant construct containing a 
regulatory sequence operably linked to a polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide wherein expression of said recombinant construct results 

20 in modified sterol composition of said plant as compared to the same plant without said 
recombinant construct. 

i 

In a further aspect is provided an oil obtained from any of the plants or host cells of 
the present invention. 

In still another aspect is provided a method for producing an oil with a modified 

2 5 sterol composition comprising providing any of the plants or host cells of the present 

invention and extracting oil from the plant by any known method. Also provided is an oil 
produced by the preceding method. 

Still another aspect provides a method for altering oil production by a host cell 
comprising, transforming a host cell with a recombinant construct containing a regulatory 

3 0 sequence operably linked to a polynucleotide encoding a lecithinxholesterol 

acyltransferase-like polypeptide and culturing the host cell under conditions wherein the 
host cell expresses a lecithinxholesterol acyltransferase-like polypeptide such that the host 
cell has an altered oil production as compared to host cells without the recombinant 
construct. 
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Another aspect provides a method for altering oil production by a host cell 
comprising, transforming a host cell with a recombinant construct containing a regulatory 
sequence operably linked to a polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide and culturing the host cell under conditions wherein the 
5 host cell expresses an acyl CoAxholesterol acyltransferase-like polypeptide such that the 
host cell has an altered oil production as compared to host cells without the recombinant 
construct. 

Also provided is a plant comprising a recombinant construct containing a 
regulatory sequence operably linked to a polynucleotide encoding a lecithinxholesterol 
1 0 acyltransferase-like polypeptide wherein expression of said recombinant construct results 
in an altered production of oil by said plant as compared to the same plant without said 
recombinant construct 

In a further aspect is provided a plant comprising a recombinant construct 
containing a regulatory sequence operably linked to a polynucleotide encoding an acyl 
1 5 CoAxholesterol acyltransferase-like polypeptide wherein expression of said recombinant 
construct results in an altered production of oil by said plant as compared to the same plant 
without said recombinant construct 

Additional aspects provide a food, food ingredient or food product comprising any 
oil, plant or host cell of the present invention; a nutritional or dietary supplement 
2 0 comprising any oil, plant or host cell of the present invention; and a pharmaceutical 

composition comprising any oil, plant or host cell of the present invention along with a 
suitable diluent, carrier or excipient 

Additional aspects will be apparent from the descriptions and examples that follow. 

2 5 BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features, aspects, and advantages of the present invention will 
become bettor understood with regard to the following description, appended claims and 
accompanying figures where: 

Figure 1 shows an alignment of yeast, human and rat lecithinrcholesterol 

3 0 acyltransferase protein sequences with Arabidopsis LCAT1 , LCAT2, LCAT3, and LCAT4 

deduced amino acid sequences. 

Figure 2 shows the results of NMR sterol ester analysis on T2 seed from plant 
expressing LCAT4 under the control of the napin promoter (pCGN9998). 
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Figure 3 shows the results of HPLC/MS sterol analysis on oil extracted from T2 
seed from control lines (pCGN8640) and lines expressing LCAT3 (pCGN9968) under the 
control of the napin promoter. 

Figure 4 shows the results of HPLC/MS sterol analysis on oil extracted from T2 
5 seed from control lines (pCGN8640), and plant line expressing LCAT1 (pCGN9962), 
LCAT2 (pCGN9983), LCAT3 (pCGN9968), and LCAT4 (pCGN9998) under the control 
of the napin promoter. Additionally, data from 3 lines expressing LCAT4 under the 
control of the 35S promoter (pCGN9996) are shown. 

Figure 5 shows the results of Nir analysis of the oil content of T2 seed from control 
1 0 lines (pCGN8640), and plant lines expressing LCAT1 (pCGN9962), LCAT2 

(pCGN9983), and LCAT3 (pCGN9968) under the control of the napin promoter 
Additionally, data from 16 lines expressing LCAT2 under the control of the 35S promoter 
(pCGN9981) are shown. 

1 5 DETAILED DESCRIPTION 

The following detailed description is provided to aid those skilled in the art in 
practicing the present invention. Even so, this detailed description should not be construed 
to unduly limit the present invention as modifications and variations in the embodiments 
discussed herein can be made by those of ordinary skill in the art without departing from 
2 0 the spirit or scope of the present inventive discovery. 

All publications, patents, patent applications and other references cited in this 
application are herein incorporated by reference in their entirety as if each individual 
publication, patent, patent application or other reference were specifically and individually 
indicated to be incorporated by reference. 

2 5 The present invention relates to lecithinxholesterol acyltransferase, particularly the 

isolated nucleic acid sequences encoding lecithin:cholesterol-like polypeptides (LCAT) 
from plant sources and acyl CoA:choIesterol:acyltransferase, particularly the isolated 
nucleic acid sequences encoding acyl CoAxhoIesterol acyltransferase-like polypeptides 
(ACAT) from plant sources. Lecithinxholesterol acyltransferase-like as used herein 

3 0 includes any nucleic acid sequence encoding an amino acid sequence from a plant source, 

such as a protein, polypeptide or peptide, obtainable from a cell source, which 
demonstrates the ability to utilize lecithin (phosphatidyl choline) as an acyl donor for 
acylation of sterols or glycerides to form esters under enzyme reactive conditions along 
with such proteins polypeptides and peptides. Acyl CoAxhoIesterol acyltransferase-like 
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as used herein includes any nucleic acid sequence encoding an amino acid sequence from a ' 
plant source, such as a protein, polypeptide or peptide, obtainable from a cell source, 
which demonstrates the ability to utilize acyl Co A as an acyl donor for acylation of sterols 
or glycerides to form esters under enzyme reactive conditions along with such proteins 
5 polypeptides and peptides. By "enzyme reactive conditions" is meant that any necessary 
conditions are available in an environment (i.e., such factors as temperature, pH, lack of 
inhibiting substances) which will permit the enzyme to function. 

The term "sterol" as applied to plants refers to any chiral tetracyclic isopentenoid 
which may be formed by cyclization of squalene oxide through the transition state 

1 0 possessing stereochemistry similar to the trans-syn-trans-anti-trans-anti configuration, for 
example, protosteroid cation, and which retains a polar group at C-3 (hydroxyl or keto), an 
all-trans-anti stereochemistry in the ring system, and a side-chain 20R-configuration 
(Parker, et al (1992) In Nes, et ai, Eds., Regulation of Isopentenoid Metabolism, ACS 
Symposium Series No. 497, p. 1 10; American Chemical Society, Washington, D.C.). 

1 5 Sterols may or may not contain a C-5-C-6 double bond, as this is a feature 

introduced late in the biosynthetic pathway. Sterols contain a C 8 -C l0 side chain at the C- 17 
position. 

The term "phytosterol," which applies to sterols found uniquely in plants, refers to 
a sterol containing a C-5, and in some cases a C-22, double bond. Phytosterols are further 
20 characterized by alkylation of theC-17 side-chain with a methyl or ethyl substituent at the 
C-24 position. Major phytosterols include, but are not limited to, sitosterol, stigmasterol, 
campesterol, brassicasteroi, etc. Cholesterol, which lacks a C-24 methyl or ethyl side- 
chain, is found in plants, but is not unique thereto, and is not a "phytosterol. ,, 

<4 Phytostanols" are saturated forms of phytosterols wherein the C-5 and, when 
2 5 present, C-22 double bond(s) is (are) reduced, and include, but are not limited to, 
sitostanol, campestanol, and 22-dihydrobrassicastanol. 

"Sterol esters" are further characterized by the presence of a fatty acid or phenolic 
acid moiety rather than a hydroxyl group at the C-3 position. 

The term "sterol" includes sterols, phytosterols, phytosterol esters, phytostanols, 
30 and phytostanol esters. 

The term "sterol compounds" includes sterols, phyotsterols, phytosterol esters, 
phytostanols, and phytostanol esters. 

The term "phytosterol compound" refers to at least one phytosterol, at least one 
phytosterol ester, or a mixture thereof. 
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The term "phytostanol compound" refers to at least one phytostanol, at least one 
phytostanol ester, or a mixture thereof. 

The terra "glyceride" refers to a fatty acid ester of glycerol and includes mono-, di-, 
and tri- acylglycerols. 

5 As used herein, "recombinant construct" is defined either by its method of 

production or its structure. In reference to its method of production, e.g., a product made 
by a process, the process is use of recombinant nucleic acid techniques, e.g., involving 
human intervention in the nucleotide sequence, typically selection or production. 
Alternatively, in terms of structure, it can be a sequence comprising fusion of two or more 
10 nucleic acid sequences which are not naturally contiguous or operatively linked to each 
other 

As used herein, "regulatory sequence" means a sequence of DNA concerned with 
controlling expression of a gene; e.g. promoters, operators and attenuators. A 
"heterologous regulatory sequence" is one which differs from the regulatory sequence 
1 5 naturally associated with a gene. 

As used herein, "polynucleotide" and "oligonucleotide" are used interchangeably 
and mean a polymer of at least two nucleotides joined together by a phosphodiester bond 
and may consist of either ribonucleotides or deoxynucleo tides. 

As used herein, "sequence" means the linear order in which monomers appear in a 
2 0 polymer, for example, the order of amino acids in a polypeptide or the order of nucleotides 
in a polynucleotide. 

As used herein, "polypeptide", "peptide", and "protein" are used interchangeably 
and mean a compound that consist of two or more amino acids that are linked by means of 
peptide bonds. 

25 As used herein, the terms "complementary" or "complementarity" refer to the 

pairing of bases, purines and pyrimidines, that associate through hydrogen bonding in 
double stranded nucleic acids. For example, the following base pairs are complementary: 
guanine and cytosine; adenine and thymine; and adenine and uracil. The terms, as used 
herein, include complete and partial complementarity. 
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Isolated proteins, Polypeptides and Polynucleotides 

A first aspect of the present invention relates to isolated LCAT polynucleotides. 
The polynucleotide sequences of the present invention include isolated polynucleotides 
that encode the polypeptides of the invention having a deduced amino acid sequence 
5 selected from the group of sequences set forth in the Sequence Listing and to other 
polynucleotide sequences closely related to such sequences and variants thereof. 

The invention provides a polynucleotide sequence identical over its entire length to 
each coding sequence as set forth in the Sequence Listing. The invention also provides the 
coding sequence for the mature polypeptide or a fragment thereof, as well as the coding 

1 0 sequence for the mature polypeptide or a fragment thereof in a reading frame with other 
coding sequences, such as those encoding a leader or secretory sequence, a pre-, pro-, or 
prepro- protein sequence. The polynucleotide can also include non-coding sequences, 
including for example, but not limited to, non-coding 5' and 3' sequences, such as the 
transcribed, untranslated sequences, termination signals, ribosome binding sites, sequences 

1 5 that stabilize mRNA, introns, polyadenylation signals, and additional coding sequence that 
encodes additional amino acids. For example, a marker sequence can be included to 
facilitate the purification of the fused polypeptide. Polynucleotides of the present 
invention also include polynucleotides comprising a structural gene and the naturally 
associated sequences that control gene expression. 

2 0 The invention also includes polynucleotides of the formula: 

wherein, at the 5' end, X is hydrogen, and at the 3' end, Y is hydrogen or a metal, R, and 
Ra are any nucleic acid residue, n is an integer between 0 and 3000, preferably between 1 
and 1000 and Rj is a nucleic acid sequence of the invention, particularly a nucleic acid 

2 5 sequence selected from the group set forth in the Sequence Listing and preferably SEQ ID 

NOs: 2, 4, 6, 8, 10-29, 33, 42-5 1 , 73 and 75. In the formula, R 2 is oriented so that its 5' 
end residue is at the left, bound to R„ and its 3' end residue is at the right, bound to R 3 . 
Any stretch of nucleic acid residues denoted by either R group, where R is greater man 1, 
may be either a heteropolymer or a homopolymer, preferably a heteropolymer. 

3 0 The invention also relates to variants of the polynucleotides described herein that 

encode for variants of the polypeptides of the.invention. Variants that are fragments of the 
polynucleotides of the invention can be used to synthesize full-length polynucleotides of 
the invention. Preferred embodiments are polynucleotides encoding polypeptide variants 
wherein 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues of a polypeptide sequence of 
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the invention are substituted, added or deleted, in any combination. Particularly preferred 
are substitutions, additions, and deletions that are silent such that they do not alter the 
properties or activities of the polynucleotide or polypeptide. 

Further preferred embodiments of the invention that are at least 50%, 60%, or 70% 
5 identical over their entire length to a polynucleotide encoding a polypeptide of the 
invention, and polynucleotides that are complementary to such polynucleotides. More 
preferable are polynucleotides that comprise a region that is at least 80% identical over its 
entire length to a polynucleotide encoding a polypeptide of the invention and 
polynucleotides that are complementary thereto. In this regard, polynucleotides at least 

1 0 90% identical over their entire length are particularly preferred, those at least 95% 
identical are especially preferred. Further, those with at least 97% identity are highly 
preferred and those with at least 98% and 99% identity are particularly highly preferred, 
with those at least 99% being the most highly preferred. 

Preferred embodiments are polynucleotides that encode polypeptides that retain 

1 5 substantially the same biological function or activity as determined by the methods 

described herein as the mature polypeptides encoded by the polynucleotides set forth in the 
Sequence Listing. 

The invention further relates to polynucleotides that hybridize to the above- 
described sequences. In particular, the invention relates to polynucleotides that hybridize 
2 0 under stringent conditions to the above-described polynucleotides. An example of 

stringent hybridization conditions is overnight incubation at 42°G in a solution comprising 
50% formamide, 5x SSC (1 50 mM NaCl, 1 5 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7,6), 5x Denhardt's solution, 10% dextran sulfate, and 20 
micrograms/milliliter denatured, sheared salmon sperm DNA, followed by washing the 

2 5 hybridization support in 0. lx SSC at approximately 65°C. Also included are 

polynucleotides that hybridize under a wash stringency of 0.1X SSC or 0.1X SSPE (at 
50°C Other hybridization and wash conditions are well known and are exemplified in 
Sambrook, et al y Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring 
Harbor, NY (1 989), particularly Chapter 11. 

3 0 The invention also provides a polynucleotide consisting essentially of a 

polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence set for in the Sequence Listing under 
stringent hybridization conditions with a probe having the sequence of said polynucleotide 
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sequence or a fragment thereof; and isolating said polynucleotide sequence. Methods for ' 
screening libraries are well known in the art and can be found for example in Sambrook, et 
al. y Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, NY 
(1989), particularly Chapter 8 and Ausubel et aL, Short Protocols in Molecular Biology, 
5 3 ri ed, Wiley and Sons, 1 995, chapter 6. Nucleic acid sequences useful for obtaining such 
a polynucleotide include, for example, probes and primers as described herein and in 
particular SEQ ID NO: 2, 4, 6, 8, 10-29, 33, 42-51, 73 and 75. These sequences are 
particularly useful in screening libraries obtained from Arabidopsis, soybean and coin for 
sequences encoding lecithinxholesterol acyltransferase and lecithinxholesterol 
1 0 acyltransferase-like polypeptides and for screening libraries for sequences encoding acyl 
CoAxholesterol acyl transferase and acyl CoAxholesterol acyl transferase-like 
polypeptides. 

As discussed herein regarding polynucleotide assays of the invention, for example, 
polynucleotides of the invention can be used as a hybridization probe for RNA, cDNA, or 

1 5 genomic DNA to isolate fiill length cDNAs or genomic clones encoding a polypeptide and 
to isolate cDNA or genomic clones of other genes that have a high sequence similarity to a 
polynucleotide set forth in the Sequence Listing and in particular SEQ ID NO: 2, 4, 6, 8, 
10-29, 33, 42-51, 73 and 75. Such probes will generally comprise at least 15 bases. 
Preferably such probes will have at least 30 bases and can have at least 50 bases. 

2 0 Particularly preferred probes will have between 30 bases and 50 bases, inclusive. 

The coding region of each gene that comprises or is comprised by a polynucleotide 
sequence set forth in the Sequence Listing may be isolated by screening using a DNA 
sequence provided in the Sequence Listing to synthesize an oligonucleotide probe. A 
labeled oligonucleotide having a sequence complementary to that of a gene of the 

2 5 invention is then used to screen a library of cDNA, genomic DNA or mRNA to identify 

members of the library which hybridize to the probe. For example, synthetic 
oligonucleotides are prepared which correspond to the LCAT EST sequences. The 
oligonucleotides are used as primers in polymerase chain reaction (PGR) techniques to 
obtain 5' and 3* terminal sequence of LCAT genes. Alternatively, where oligonucleotides 

3 0 of low degeneracy can be prepared from particular LCAT peptides, such probes may be 

used directly to screen gene libraries for LCAT gene sequences. In particular, screening of 
cDNA libraries in phage vectors is useful in such methods due to lower levels of 
background hybridization. 
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Typically, a LCAT sequence obtainable from the use of nucleic acid probes will 
show 60-70% sequence identity between the target LCAT sequence and the encoding 
sequence used as a probe. However, lengthy sequences with as little as 50-60% sequence 
identity may also be obtained. The nucleic acid probes may be a lengthy fragment of the 
5 nucleic acid sequence, or may also be a shorter, oligonucleotide probe. When longer 
nucleic acid fragments are employed as probes (greater than about 100 bp), one may 
screen at lower stringencies in order to obtain sequences from the target sample which 
have 20-50% deviation (i.e., 50-80% sequence homology) from the sequences used as 
probe. Oligonucleotide probes can be considerably shorter than the entire nucleic acid 

1 0 sequence encoding an LCAT enzyme, but should be at least about 10, preferably at least 
about 1 5, and more preferably at least about 20 nucleotides. A higher degree of sequence 
identity is desired when shorter regions are used as opposed to longer regions. It may thus 
be desirable to identify regions of highly conserved amino acid sequence to design 
oligonucleotide probes for detecting and recovering other related LCAT genes. Shorter 

15 probes are often particularly useful for polymerase chain reactions (PCR), especially when 
highly conserved sequences can be identified. (See, Gould, et al, PNAS USA (1989) 
£5:1934-1938.). 

Another aspect of the present invention relates to LCAT polypeptides. Such 
polypeptides include isolated polypeptides set forth in the Sequence Listing, as well as 
2 0 polypeptides and fragments thereof; particularly those polypeptides which exhibit LCAT 
activity and also those polypeptides which have at least 50%, 60% or 70% identity, • 
preferably at least 80% identity, more preferably at least 90% identity, and most preferably 
at least 95% identity to a polypeptide sequence selected from the group of sequences set 
forth in the Sequence Listing, and also include portions of such polypeptides, wherein 

2 5 such portion of the polypeptide preferably includes at least 30 amino acids and more 

preferably includes at least 50 amino acids. 

"Identity", as is well understood in the art, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by 
comparing the sequences. In the art, "identity" also means the degree of sequence 

3 0 relatedness between polypeptide or polynucleotide sequences, as determined by the match 

between strings of such sequences. "Identity'* can be readily calculated by known methods 
including, but not limited to, those described in Computational Molecular Biology, Lesk, 
A.M., ed., Oxford University Press, New York (1988); Biocomputing: Informatics and 
Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; Computer Analysis 
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of Sequence Data, Part I f Griffin, A.M. and Griffin, H.G., eds., Humana Press, New Jersey' 
(1994); Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press (1987); 
Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., Stockton Press, New 
York (1991); and Carillo, R, and Lipman, D., SIAM J Applied Math, 48:1073 (1988). 
5 Methods to detennine identity are designed to give the largest match between the 
sequences tested. Moreover, methods to determine identity are codified in publicly 
available programs. Computer programs which can be used to determine identity between 
two sequences include, but are not limited to, GCG (Devereux, J., et al., Nucleic Acids 
Research 12(1):387 (1984); suite of five BLAST programs, three designed for nucleotide 

1 0 sequences queries (BLASTN, BLASTX, and TBLASTX) and two designed for protein 
sequence queries (BLASTP and TBLASTN) (Cbulson, Trends in Biotechnology, 12: 16- 
80 (1994); Birren, et al, Genome Analysis, 1; 543-559 (1997)). The BLAST X program is 
publicly available from NGBI and other sources {BLAST Manual, Altschul, S., et al $ 
NCBI NLM NIH, Bethesda, MD 20894; Altschul, S M et al, 1 Mol Biol, 215:403-410 

1 5 (1990)). The well known Smith Waterman algorithm can also be used to detennine 
identity. 

Parameters for polypeptide sequence comparison typically include the following: 
Algorithm: Needleman and Wunsch, 1 Mol Biol 48:443-453 (1970) 
Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proc. Natl 
20 Acad. Sci USA 89:10915-10919 (1992) 
Gap Penalty: 12 
Gap Length Penalty: 4 

A program which can be used with these parameters is publicly available as the 
"gap" program from Genetics Computer Group, Madison Wisconsin. The above 
2 5 parameters along with no penalty for end gap are the default parameters for peptide 
comparisons. 

Parameters for polynucleotide sequence comparison include the following: 
Algorithm: Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970) 
Comparison matrix: matches = +10; mismatches = 0 
30 Gap Penalty: 50 

Gap Length Penalty: 3 

A program which can be used with these parameters is publicly available as the 
"gap" program from Genetics Computer Group, Madison Wisconsin. The above 
parameters are the default parameters for nucleic acid comparisons. 
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The invention also includes polypeptides of the formula: 
X-(R,) n -(R 2 )-{R3VY 

wherein, at the amino terminus, X is hydrogen, and at the carboxyl terminus, Y is 
hydrogen or a metal, R, and R 3 are any amino acid residue, n is an integer between 0 and 
5 1 000, and R 2 is an amino acid sequence of the invention, particularly an amino acid 

sequence selected from the group set forth in the Sequence Listing and preferably SEQ ID 
NOs: 3, 5, 7, 9, 74 arid 76. In the formula, Rj is oriented so that its amino terminal residue 
is at the left, bound to R„ and its carboxy terminal residue is at the right, bound to R 3 . 
Any stretch of amino acid residues denoted by either R group, where R is greater than 1 , 
1 0 may be either a heteropolymer or a homopolymer, preferably a heteropolymer. 

Polypeptides of the present invention include isolated polypeptides encoded by a 
polynucleotide comprising a sequence selected from the group of a sequence contained in 
SEQ ID NOs: 2, 4, 6, 8, 73 and 75. 

The polypeptides of the present invention can be mature protein or can be part of a 
15 fusion protein. 

Fragments and variants of the polypeptides are also considered to be a part of the 
invention. A fragment is a variant polypeptide which has an amino acid sequence that is 
entirely the same as part but not all of the amino acid sequence of the previously described 
polypeptides. The fragments can be "free-standing" or comprised within a larger 
2 0 polypeptide of which the fragment forms a part or a region, most preferably as a single 
continuous region. Preferred fragments are biologically active fragments which are those 
fragments that mediate activities of the polypeptides of the invention, including those with 
similar activity or improved activity or with a decreased activity. Also included are those 
polypeptides and polypeptide fragments that are antigenic or immunogenic in an animal, 
2 5 particularly a human and antibodies, either polyclonal or monoclonal that specifically bind 
the antigenic fragments. In one preferred embodiment, such antigenic or immunogenic 
fragments comprise at least 10 consecutive amino acids from the amino acid sequences 
disclosed herein or such sequences with at least one conservative amino acid substitution. 
In additional embodiments, such antigenic or immunogenic fragments comprise at least 
30 1 5, at least 25, at least 50 or at least 100 consecutive amino acids from the amino acid 
sequences disclosed herein or such sequences with at least one conservative amino acid 
substitution. Methods for the production of antibodies from polypeptides and 
polypeptides conjugated to carrier molecules are well known in the art and can be found 
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for example in Ausubel et al., Short Protocols in Molecular Biology, 3 rf e&, Wiley & 
Sons, 1 995, particularly chapter 11. 

Variants of the polypeptide also include polypeptides that vary from the sequences 
set forth in the Sequence Listing by conservative amino acid substitutions, substitution of 
5 a residue by another with like characteristics. Those of ordinary skill in the art are aware 
that modifications in the amino acid sequence of a peptide, polypeptide, or protein can 
result in equivalent, or possibly improved, second generation peptides, etc., that display 
equivalent or superior functional characteristics when compared to the original amino acid 
sequence. The present invention accordingly encompasses such modified amino acid 
10 sequences. Alterations can include amino acid insertions, deletions, substitutions, 

truncations, fusions, shuffling of subunit sequences, and the like, provided that the peptide 
sequences produced by such modifications have substantially the same functional 
properties as the naturally occurring counterpart sequences disclosed herein. 

One factor that can be considered in making such changes is the hydropathic index 
15 of amino acids. The importance of the hydropathic amino acid index in conferring 

interactive biological function on a protein has been discussed by Kyte and Doolittle ( X 
Mol BioL, 157: 105-132, 1982). It is accepted that the relative hydropathic character of 
amino acids contributes to the secondary structure of the resultant protein. This, in turn, 
affects the interaction of the protein with molecules such as enzymes, substrates, receptors, 

2 0 DNA, antibodies, antigens, etc. 

Based on its hydrophobicity and charge characteristics, each amino acid has been 
assigned a hydropathic index as follows: isoleucine (+4.5); valine (+4.2); leucine (+3.8); 
phenylalanine (+2.8); cysteinefcystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (- 
0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); 
2 5 histidine (-3.2); glutamate/glutamine/aspartate/asparagine (-3.5); lysine (-3.9); and 
arginine (-4.5). 

As is known in the art, certain amino acids in a peptide or protein can be 
substituted for other amino acids having a similar hydropathic index or score and produce 
a resultant peptide or protein having similar biological activity, i.e., which still retains 

3 0 biological functionality. In making such changes, it is preferable that amino acids having 

hydropathic indices within ±2 are substituted for one another. More preferred 
substitutions are those wherein the amino acids have hydropathic indices within ±1. Most 
preferred substitutions are those wherein the amino acids have hydropathic indices within 
±05. 
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Like amino acids can also be substituted on the basis of hydrophilicity; U S. Patent 
No. 4,554,101 discloses that the greatest local average hydrophilicity of a protein, as 
governed by the hydrophilicity of its adjacent amino acids, correlates with a biological 
property of the protein. The following hydrophilicity values have been assigned to amino 
5 acids: arginine/lysine (+3.0); aspartate/glutamate (+3.0 ±1); serine (+0.3); 
asparagine/glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ±1); 
alaninetfiistidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); leucine/isoleucine 
(-1.8); tyrosine (-2.3); phenylalanine (-2.5); and tryptophan (-3.4). Thus, one amino acid 
in a peptide, polypeptide, or protein can be substituted by another amino acid having a 

1 0 similar hydrophilicity score and still produce a resultant protein having similar biological 
activity, i.e., still retaining correct biological function. In making such changes, amino 
acids having hydropathic indices within ±2 are preferably substituted for one another, 
those within ±1 are more preferred, and those within ±0.5 are most preferred. 

As outlined above, amino acid substi tutions in the peptides of the present invention 

15 can be based on the relative similarity of the amino acid side-chain substituents, for 

example, their hydrophobicity, hydrophilicity, charge, size, etc. Exemplary substitutions 
that take various of the foregoing characteristics into consideration in order to produce 
conservative amino acid changes resulting in silent changes within the present peptides, 
etc., can be selected from other members of the class to which the naturally occurring 

2 0 amino acid belongs. Amino acids can be divided into the following four groups: (1) acidic 
amino acids; (2) basic amino acids; (3) neutral polar amino acids; and (4) neutral non- 
polar amino acids. Representative amino acids within these various groups include, but 
are not limited to: (I) acidic (negatively charged) amino acids such as aspartic acid and 
glutamic acid; (2) basic (positively charged) amino acids such as arginine, histidine, and 

2 5 lysine; (3) neutral polar amino acids such as glycine, serine, threonine, cysteine, cystine, 

tyrosine, asparagine, and glutamine; and (4) neutral non-polar amino acids such as alanine, 
leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine. It should 
be noted that changes which are not expected to be advantageous can also be useful if 
these result in the production of functional sequences. 

3 0 Variants that are fragments of the polypeptides of the invention can be used to 

produce the corresponding full length polypeptide by peptide synthesis. Therefore, these 
variants can be used as intermediates for producing the full-length polypeptides of the 
invention. 
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The polynucleotides and polypeptides of the invention can be used, for example, in 
the transformation of host cells, such as plant cells, animal cells, yeast cells, bacteria, 
bacteriophage, and viruses, as further discussed herein. 

The invention also provides polynucleotides that encode a polypeptide that is a 
5 mature protein plus additional amino or carboxyl-tenninal amino acids, or amino acids 
within the mature polypeptide (for example, when the mature form of the protein has more 
than one polypeptide chain). Such sequences can, for example, play a role in the 
processing of a protein from a precursor to a mature form, allow protein transport, shorten 
or lengthen protein half-life, or facilitate manipulation of the protein in assays or 
1 0 production. It is contemplated that cellular enzymes can be used to remove any additional 
amino acids from the mature protein. 

A precursor protein, having the mature form of the polypeptide fused to one or 
more prosequences may be an inactive form of the polypeptide. The inactive precursors 
generally are activated when the prosequences are removed. Some or all of the 
1 5 prosequences may be removed prior to activation. Such precursor protein are generally 
called proproteins. 


Preparation of Expression Constructs and Methods of Use 

Of interest is the use of the nucleotide sequences in recombinant DNA constructs 
to direct the transcription or transcription and translation (expression) of the 
acyltransferase sequences of the present invention in a host cell. Of particular interest is 
the use of the polynucleotide sequences of the present invention in recombinant DNA 
constructs to direct the transcription or transcription and translation (expression) of the 
acyltransferase sequences of the present invention in a host plant cell. 

The expression constructs generally comprise a regulatory sequence functional in a 
host cell operably linked to a nucleic acid sequence encoding a lecithinxholesterol 
acyltransferase-like polypeptide or acyl CoAxholesterbl acyltransferase-like polypeptide 
of the present invention and a transcriptional termination region functional in a host plant 
cell. Of particular interest is the use of promoters (also referred to as transcriptional 
30 initiation regions) functional in plant host cells. 

Those skilled in the art will recognize that there are a number of promoters which 
are functional in plant cells, and have been described in the literature including 
constitutive, inducible, tissue specific, organelle specific, developmentally regulated and 
environmentally regulated promoters. Chloroplast and plastid specific promoters, 
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chloroplast or plastid functional promoters, and chloroplast or plastid operable promoters 
are also envisioned. 

One set of promoters are constitutive promoters such as the CaMV35S or FMV35S 
promoters that yield high levels of expression in most plant organs. Enhanced or 
5 duplicated versions of the CaMV35S and FMV35S promoters are useful in the practice of 
this invention (Odell, et al (1985) Nature 3 1 3:810-812; Rogers, US. Patent Number 
5,378, 619). Other useful constitutive promoters include, but are not limited to, the 
mannopine synthase (mas) promoter, the nopaline synthase (nos) promoter, and the 
octopine synthase {pes) promoter. 
1 0 Useful inducible promoters include heat-shock promoters (Ou-Lee et al. (1 986) 

Proc. Natl Acad. Set USA 83: 6815; Ainley et al. (1990) Plant MoL Biol 14: 949), a 
nitrate-ihducible promoter derived from the spinach nitrite reductase gene (Back et al. 
(1991) Plant MoL Biol 17: 9), hormone-inducible promoters (Yamaguchi-Shinozaki e t al. 
(1990) Plant MoL Biol 15: 905; Kares et al (1990) Plant MoL Biol 15: 905), and 

1 5 light-inducible promoters associated with the small subunit of RuBP carboxylase and 
LHCP gene families (Kuhlemeier et aL (1989) Plant Cell 1 : 471; Feinbaum et al. (1991) 
MoL Gen: Genet 226: 449; Weisshaar et al. (1991) EMBOJ 10: 1777; Lam and Chua 
(1990) Science 248: 471; Castresana et al. (1988) EMBOJ 7: 1929; Schulze-Lefert et al. 
(1989) EMBOJ 8:651). 

20 In addition, it may also be preferred to bring about expression of the 

acyltransferase gene in specific tissues of the plant, such as leaf, stem, root, tuber, seed, 
fruit, etc., and the promoter chosen should have die desired tissue and developmental 
specificity. Examples ofuseful tissue-specific, developmentally-regulated promoters 
include fruit-specific promoters such as the E4 promoter (Cordes et al. (1989) Plant Cell 

25 1:1025), the E8 promoter (Deikman et al. (1988) EMBOJ. 7: 3315), the kiwifruit 

actinidin promoter (Lin et al. (1993) PNAS 90: 5939), the 2A1 1 promoter (Hoiick et al., 
U.S. Patent 4,943,674), and the tomato pZ130 promoter (U.S. Patents 5,175, 095 and 
5,530,185); the 0-conglycinin 7S promoter (Doyle et al. (1986) J Biol Chem. 261: 9228; 
Slighton and Beachy (1987) Planta 172: 356), and seed-specific promoters (Knutzon et aL 

30 (1992) Proc. Natl Acad. Set. USA 89: 2624; Bustos et al. (1991) EMBOJ. 10: 1469;Lam 
and Chua (1991) J.Biol Chem.266: 17131; Stayton etal. (1991) Aust J Plant Physiol 
18: 507). Fruit-specific gene regulation is discussed in U.S. Patent 5,753,475. Other 
useful seed-specific promoters include, but are not limited to, the napin, phaseolin, zein, 
soybean trypsin inhibitor, 7S, ADR12, ACP, stearoyl-ACP desaturase, oleosin, 
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Lasquerella hydroxylase, and barley aldose reductase promoters (Bartels (1995) Plant J. 7: " 
809-822), the EA9 promoter (U.S. Patent 5,420,034), and the Bce4 promoter (U.S. Patent 
5,530,194). Useful embryo-specific promoters include the com globulin 1 and oleosin 
promoters. Useful endosperm-specific promoters include the rice glutelin-1 promoter, the 
5 promoters for the low-pi p amylase gene (Amy32b) (Rogers et al. (1984) J. Biol Chern. 
259: 12234), the high-pi p amylase gene (Amy 64) (Khurseed et al. (1988) 1 Biol Chem. 
263: 18953), and the promoter for a barley thiol protease gene ("AleurauO (Whittier et al. 
(1987) Nucleic Acids Res, 15:2515). 

Of particular interest is the expression of the nucleic acid sequences of the present 

1 0 invention from transcription initiation regions which are preferentially expressed in a plant 
seed tissue. Examples of such seed preferential transcription initiation sequences include 
those sequences derived from sequences encoding plant storage protein genes or from 
genes involved in fatty acid biosynthesis in oilseeds. Examples of such promoters include 
the 5' regulatory regions from such genes as napin (Kridl et al, Seed Set Res. 7:209:219 

15 (1991)), phaseolin, zein, soybean trypsin inhibitor, ACP, stearoyl-ACP desaturase, 

soybean a* subunit of P-conglycinin (soy 7s, (Chen et al, Proc. Natl Acad. ScL, 83:8560- 
8564 (1986))) and oleosin. Seed-specific gene regulation is discussed in EP 0 255 378 Bl 
and U.S. Patents 5,420,034 and 5,608,152 . Promoter hybrids can also be constructed to 
enhance transcriptional activity (Hoffman, U.S. Patent No. 5,106,739), or to combine 

2 0 desired transcriptional activity and tissue specificity. 

It may be advantageous to direct the localization of proteins conferring LCAT to a 
particular subcellular compartment, for example, to the mitochondrion, endoplasmic 
reticulum, vacuoles, chloroplast or other plastidic compartment. For example, where the 
genes of interest of the present invention will be targeted to plastids, such as chloroplasts, 

2 5 for expression, the constructs will also employ the use of sequences to direct the gene to 

the plastid. Such sequences are referred to herein as chloroplast transit peptides (CTP) or 
plastid transit peptides (PTP). In this manner, where the gene of interest is not directly 
inserted into the plastid, the expression construct will additionally contain a gene encoding 
a transit peptide to direct the gene of interest to the plastid. The chloroplast transit 

3 0 peptides may be derived from the gene of interest, or may be derived from a heterologous 

sequence having a CTP. Such transit peptides are known in the art. See, for example, Von 
Heijne et al (1991) Plant Mol Biol Rep. 9:104-126; Clark et al. (1989) Biol Chem. 
264:17544-17550; della-Cioppa et al (1987) Plant Physiol <W:965-968; Romer et al 
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(1993) Biochem. Biophys. Res Commun. 796:1414-1421; and, Shah et al (1986) Science 
233:478-481. 

Depending upon the intended use, the constructs may contain the nucleic acid 
sequence which encodes the entire LCAT protein, a portion of the LCAT protein, the 
5 entire AC AT protein, or a portion of the ACAT protein. For example, where antisense 
inhibition of a given LCAT or ACAT protein is desired, the entire sequence is not 
required. Furthermore, where LCAT or ACAT sequences used in constructs are intended 
for use as probes, it may be advantageous to prepare constructs containing only a 
particular portion of a LCAT or ACAT encoding sequence, for example a sequence which 
10 is discovered to encode a highly conserved region. 

The skilled artisan will recognize that there are various methods for the inhibition 
of expression of endogenous sequences in a host cell. Such methods include, but are not 
limited to antisense suppression (Smith, et al (1988) Nature 334:724-726) , co- 
suppression (Napoli, et al (1989) Plant Cell 2:279-289), ribozymes (PCT Publication 
15 WO 97/10328), and combinations of sense and antisense Waterhouse, et al (1998) Proc. 
NatL Acad, ScL USA 95:13959-13964. Methods for the suppression of endogenous 
sequences in a host cell typically employ the transcription or transcription and translation 
of at least a portion of the sequence to be suppressed. Such sequences may be homologous 
to coding as well as non-coding regions of the endogenous sequence. 
2 0 Regulatory transcript termination regions may be provided in plant expression 

constructs of this invention as well. Transcript termination regions may be provided by 
the DNA sequence encoding the diacylglycerol acyltransferase or a convenient 
transcription termination region derived from a different gene source, for example, the 
transcript termination region which is naturally associated with the transcript initiation 

2 5 region. The skilled artisan will recognize that any convenient transcript termination region 

which is capable of teiminating transcription in a plant cell may be employed in the 
constructs of the present invention. 

Alternatively, constructs may be prepared to direct the expression of the LCAT or 
ACAT sequences directly from the host plant cell plastid. Such constructs and methods 

3 0 are known in the art and are generally described, for example, in Svab, et al (1990) Proc. 

Natl Acad. ScL USA 87:8526-8530 and Svab and Maliga<1993) Proa Natl Acad. ScL 
USA 90:913-917 and in U.S. Patent Number 5,693,507. 

A plant cell, tissue, organ, or plant into which the recombinant DNA constructs 
containing the expression constructs have been introduced is considered transformed, 
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transfected, or transgenic. A transgenic or transformed cell or plant also includes progeny 
of the cell or plant and progeny produced from a breeding program employing such a 
transgenic plant as a parent in a cross and exhibiting an altered phenotype resulting from 
the presence of a LCAT nucleic acid sequence. 
5 Plant expression or transcription constructs having a plant LCAT as the DNA 

sequence of interest for increased or decreased expression thereof may be employed with a 
wide variety of plant life, particularly, plant life involved in the production of vegetable 
oils for edible and industrial uses. Most especially preferred are temperate oilseed crops. 
Plants of interest include, but are not limited to, rapeseed (Canola and High Erucic Acid 

1 0 varieties), sunflower, safflower, cotton, soybean, peanut, coconut and oil palms, and corn. 
Depending on the method for introducing the recombinant constructs into the host cell, 
other DNA sequences may be required. Importantly, this invention is applicable to 
dicotyledyons and monocotyledons species alike and will be readily applicable to new 
and/or improved transformation and regulation techniques. 

1 5 Of particular interest, is the use of plant LCAT and ACAT constructs in plants to 

produce plants or plant parts, including, but not limited to leaves, stems, roots, 
reproductive, and seed, with a modified content of lipid and/or sterol esters and to alter the 
oil production by such plants. 

Of particular interest in the present invention, is the use of ACAT genes in 

2 0 conjunction with the LCAT sequences to increase the sterol content of seeds. Thus, 

overexpression of a nucleic acid sequence encoding an ACAT and LCAT in an oilseed 
crop may find use in the present invention to increase sterol levels in plant tissues and/or 
increase oil production. 

It is contemplated that the gene sequences may be synthesized, either completely or 
25 in part, especially where it is desirable to provide plant-preferred sequences. Thus, all or a 
portion of the desired structural gene (that portion of the gene which encodes the LCAT or 
ACAT protein) may be synthesized using codons preferred by a selected host Host- 
preferred codons may be determined, for example, from the codons used most frequently 
in the proteins expressed in a desired host species. 

3 0 One skilled in the art will readily recognize that antibody preparations, nucleic acid 

probes (DNA and RNA) and the like may be prepared and used to screen and recover 
"homologous" or "related" sequences from a variety of plant sources. Homologous 
sequences are found when there is an identity of sequence, which may be determined upon 
comparison of sequence information, nucleic acid or amino acid, or through hybridization 
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reactions between a known LCAT and a candidate source. Conservative changes, such as 
GIu/Asp, Val/Ile, Ser/Thr, Arg/Lys and Gln/Asn may also be considered in determining 
sequence homology. Amino acid sequences are considered homologous by as little as 
25% sequence identity between the two complete mature proteins. (See generally, 
5 Doolittle, R.F., OF URFS and ORFS (University Science Books, CA, 1986.) 

Thus, other LCATs may be obtained from the specific sequences provided herein. 
Furthermore, it will be apparent that one can obtain natural and synthetic sequences, 
including modified amino acid sequences and starting materials for synthetic-protein 
modeling from the exemplified LCAT and ACAT sequences and from sequences which 
10 are obtained through the use of such exemplified sequences. Modified amino acid 

sequences include sequences which have been mutated, truncated, increased and the like, 
whether such sequences were partially or wholly synthesized. Sequences which are 
actually purified from plant preparations or are identical or encode identical proteins 
thereto, regardless of the method used to obtain the protein or sequence, are equally 
1 5 considered naturally derived. 

For immunological screening, antibodies to the protein can be prepared by 
injecting rabbits or mice with the purified protein or portion thereof, such methods of 
preparing antibodies being well known to those in the art Either monoclonal or 
polyclonal antibodies can be produced, although typically polyclonal antibodies are more 
2 0 useful for gene isolation. Western analysis may be conducted to determine that a related 
protein is present in a crude extract of the desired plant species, as determined by cross- 
reaction with the antibodies to the encoded proteins. When cross-reactivity is observed, 
genes encoding the related proteins are isolated by screening expression libraries 
representing the desired plant species. Expression libraries can be constructed in a variety 

2 5 of commercially available vectors, including lambda gt 1 1, as described in Sambrook, et al 

(Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York). 

To confirm the activity and specificity of the proteins encoded by the identified 
nucleic acid sequences as acyltransferase enzymes, in vitro assays are performed in insect 

3 0 cell cultures using baculovirus expression systems. Such baculovirus expression systems 

are known in the art and are described by Lee, et al U.S. Patent Number 5,348,886, the 
entirety of which is herein incorporated by reference. 

In addition, other expression constructs may be prepared to assay for protein 
activity utilizing different expression systems. Such expression constructs are transformed 
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into yeast or prokaryotic host and assayed for acyltransferase activity. Such expression 
systems are known in the art and are readily available through commercial sources. 

The method of transformation in obtaining such transgenic plants is not critical to 
the instant invention, and various methods of plant transformation are currently available. 
5 Furthermore, as newer methods become available to transform crops, they may also be 
directly applied hereunder. For example, many plant species naturally susceptible to 
Agrobacterium infection may be successfully transformed via tripartite or binary vector 
methods of Agrobacterium mediated transformation. In many instances, it will be 
desirable to have the construct bordered on one or both sides by T-DNA, particularly 

1 0 having the left and right borders, more particularly the right border. This is particularly 
useful when the construct uses A, tumefaciens or A. rhizogenes as a mode for 
transformation, although the T-DNA borders may find use with other modes of 
transformation. In addition, techniques of microinjection, DNA particle bombardment, 
and electroporation have been developed which allow for the transformation of various 

1 5 monocot and dicot plant species. 

Normally, included with the DNA construct will be a structural gene having the 
necessary regulatory regions for expression in a host and providing for selection of 
transformant cells. The gene may provide for resistance to a cytotoxic agent, e.g. 
antibiotic, heavy metal, toxin, etc., complementation providing prototrophy to an 

2 0 auxotrophic host, viral immunity or the like. Depending upon the number of different host 
species the expression construct or components thereof are introduced, one or more 
markers may be employed, where different conditions for selection are used for the 
different hosts. 

Non-limiting examples of suitable selection markers include genes that confer 
2 5 resistance to bleomycin, gentamycin, glyphosate, hygromycin, kanamycin, methotrexate, 
phleomycin, phosphinotricin, spectinomycin, streptomycin, sulfonamide and 
sulfonylureas. Maliga et aL, Methods in Plant Molecular Biology^ Cold Spring Harbor 
Laboratory Press, 1995, p. 39. Examples of markers include, but are not limited to, 
alkaline phosphatase (AP), myc, hemagglutinin (HA), p glucuronidase (GUS), luciferase, 
30 and green fluorescent protein (GFP). 

Where Agrobacterium is used for plant cell transformation, a vector may be used 
which may be introduced into the Agrobacterium host for homologous recombination with 
T-DNA or the Ti- or Ri-plasmid present in the Agrobacterium host. The Ti- or Ri-plasmid 
containing the T-DNA for recombination may be armed (capable of causing gall 
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formation) or disarmed (incapable of causing gall formation), the latter being permissible, 
so long as the vir genes are present in the transformed Agrobacterium host The armed 
plasmid can give a mixture of normal plant cells and gall. 

In some instances where Agrobacterium is used as the vehicle for transforming 
5 host plant cells, the expression or transcription construct bordered by the T-DNA border 
region(s) will be inserted into a broad host range vector capable of replication in E. coli 
and Agrobacterium, there being broad host range vectors described in the literature. 
Commonly used is pRK2 or derivatives thereof See, for example, Ditta, et al. y (Proc. NaL 
Acad. ScL, US.A. (1980) 77:7347.7351) and EPA 0 120 515, which are incorporated 

1 0 herein by reference. Alternatively, one may insert the sequences to be expressed in plant 
cells into a vector containing separate replication sequences, one of which stabilizes the 
vector in E. coli, and the other in Agrobacterium. See, for example, McBride and 
Summerfelt {Plant Mol Biol (1990) 74:269-276), wherein the pRiHRI (Jouanin, et al, 
Mol Gen. Genet (1985) 201:370-374) origin of replication is utilized and provides for 

1 5 added stability of the plant expression vectors in host Agrobacterium cells. 

Included with the expression construct and the T-DNA can be one or more 
markers, which allow for selection of transformed Agrobacterium and transformed plant 
cells. A 

number of markers have been developed for use with plant cells, such as resistance to 
2 0 chloramphenicol, kanamycin, the aminoglycoside G41 8, hygromycin, or the like. The 

particular marker employed is not essential to this invention, one or another marker being 
preferred depending on the particular host and the manner of construction. 

For transformation of plant cells using Agrobacterium, explants may be combined 
and incubated with the transformed Agrobacterium for sufficient time for transformation, 

2 5 the bacteria killed, arid the plant cells cultured in an appropriate selective medium. Once 

callus forms, shoot formation can be encouraged by employing the appropriate plant 
hormones in accordance with known methods and the shoots transferred to rooting 
medium for regeneration of plants. The plants may then be grown to seed and the seed 
used to establish repetitive generations and for isolation of vegetable oils. 

3 0 Thus, in another aspect of the present invention, methods for modifying the sterol 

and/or stanol composition of a host cell. Of particular interest are methods for modifying 
the sterol and/or stanol composition of a host plant cell. In general the methods involve 
either increasing the levels of sterol ester compounds as a proportion of the total sterol 
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compounds. The method generally comprises the use of expression constructs to direct the " 
expression of the polynucleotides of the present invention in a host cell 

Also provided are methods for reducing the proportion of sterol ester compounds 
as a percentage of total sterol compounds in a host plant cell. The method generally 
5 comprises the use of expression constructs to direct the suppression of endogenous 
acyltransferase proteins in a host cell 

Of particular interest is the use of expression constructs to modify the levels of 
sterol compounds in a host plant cell. Most particular, the methods find use in modifying 
the levels of sterol compounds in seed oils obtained from plant seeds. 
1 0 Also of interest is the use of expression constructs of the present invention to alter 

oil production in a host cell and in particular to increase oil production. Of particular 
interest is the use of expression constructs containing nucleic acid sequences encoding 
LCAT and/or ACAT polypeptides to transform host plant cells and to use these host cells 
to regenerate whole plants having increase oil production as compared to the same plant 
1 5 not containing the expression construct. 

The oils obtained from transgenic plants having modified sterol compound content 
find use in a wide variety of applications. Of particular interest in the present invention is 
the use of the oils containing modified levels of sterol compounds in applications involved 
in improving human nutrition and cardiovascular health. For example, phytostanols are 

2 0 beneficial for lowering serum cholesterol (Ling, et ah (1995) Life Sciences 57:195-206). 

Cholesterol-lowering compositions comprise the oils and sterol ester compound 
compositions obtained using the methods of the present invention. Such cholesterol 
lowering compositions include, but are not limited to foods, food products, processed 
foods, food ingredients, food additive compositions, or dietary/nutritional supplements 
25 that contain oils and/or fats. Non-limiting examples include margarines; butters; 
shortenings; cooking oils; frying oils; dressings, such as salad dressings; spreads; 
mayonnaises; and vitamin/mineral supplements. Patent documents relating to such 
compositions include, U.S. Patents 4,588,717 and 5,244,887, and PCT International 
Publication Nos. WO 96/38047, WO 97/42830, WO 98/06405, and WO 98/06714. 

3 o Additional non-limiting examples include toppings; dairy products such as cheese and 

processed cheese; processed meat; pastas; sauces; cereals; desserts, including frozen and 
shelf-stable desserts; dips; chips; baked goods; pastries; cookies; snack bars; 
confections; chocolates; beverages; nnextracted seed; and unextracted seed that has been 
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ground, cracked, milled, rolled, extruded, pelleted, defatted, dehydrated, or otherwise 
processed, but which still contains the oils, etc., disclosed herein. 

The cholesterol-lowering compositions can also take the form of pharmaceutical 
compositions comprising a cholesterol-lowering effective amount of the oils or sterol 
5 compound compositions obtained using the methods of the present invention, along with a 
pharmaceutically acceptable carrier, excipient, or diluent. These pharmaceutical 
compositions can be in the form of a liquid or a solid. Liquids can be solutions or 
suspensions; solids can be in the form of a powder, a granule, a pill, a tablet, a gel, or an 
extrudate. US. Patent 5,270,041 relates to sterol-containing pharmaceutical compositions. 

1 0 Thus, by expression of the nucleic acid sequences encoding acyltransferase-like 

sequences of the present invention in a host cell, it is possible to modify the lipid content 
and/or composition as well as the sterol content and/or composition of the host cell. 

The invention now being generally described, it will be more readily understood by 
reference to the following examples which are included for purposes of illustration only 

1 5 and are not intended to limit the present invention. 

EXAMPLES 

Example 1: RNA Isolations 

Total RNA from the inflorescence and developing seeds of Arabidopsis thaliana 

20 was isolated for use in construction of complementary (cDNA) libraries. The procedure 
was an adaptation of the DNA isolation protocol of Webb and Knapp (D.M. Webb and S J. 
Knapp, (1990) Plant Molec. Reporter, 8, 180-185). The following description assumes the 
use of lg fresh weight of tissue. Frozen seed tissue was powdered by grinding under 
liquid nitrogen. The powder was added to 10ml REC buffer (50mM Tris-HCl, pH 9, 0.8M 

25 NaCl> lOmM EDTA, 0.5% w/v CTAB (cetyltrimethyl-ammonium bromide)) along with 
0.2g insoluble polyvinylpolypyrrolidone, and ground at room temperature. The 
homogenate was centrifuged for 5 minutes at 12,000 xg to pellet insoluble material. The 
resulting supernatant fraction was extracted with chloroform, and the top phase was 
recovered. 

3 0 The RNA was then precipitated by addition of 1 volume RecP (50mM Tris-HCL 

pH9, lOmM EDTA and 0.5% (w/v) CTAB) and collected by brief centrifiigation as before. 
The RNA pellet was redissolved in 0.4 ml of 1M NaCi. The RNA pellet was redissolved 
in water and extracted with phenol/chloroform. Sufficient 3M potassium acetate (pH 5) 
ws added to make the mixture 0.3M in acetate, followed by addition of two volumes of 
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ethanol to precipitate the RNA. After washing with ethanol, this final RNA precipitate 
was dissolved in water and stored frozen. 

Alternatively, total RNA may be obtained using TRIzol reagent (BRL- 
Lifetechnologies, Gaithersburg, MD) following the manufacturer's protocol. The RNA 
5 precipitate was dissolved in water and stored frozen. 

Example 2: Identification of LCAT Sequences 

Searches were performed on a Silicon Graphics Unix computer using additional 
Bioaccellerator hardware and Gen Web software supplied by Compugen Ltd. This 

1 0 software and hardware enabled the use of the Smith-Waterman algorithm in searching 
DNA and protein databases using profiles as queries. The program used to query protein 
databases was profilesearch. This is a search where the query is not a single sequence but 
a profile based on a multiple alignment of amino acid or nucleic acid sequences. The 
profile was used to query a sequence data set, i.e., a sequence database. The profile 

15 contained all the pertinent information for scoring each position in a sequence, in effect 
replacing the "scoring matrix" used for the standard query searches. The program used to 
query nucleotide databases with a protein profile was tprofilesearch, Tprofilesearch 
searches nucleic acid databases using an amino acid profile query. As the search is 
running, sequences in the database are translated to amino acid sequences in six reading 

2 0 frames. The output file for tprofilesearch is identical to the output file for profilesearch 
except for an additional column that indicates the frame in which the best alignment 
occurred. 

The Smith-Waterman algorithm, (Smith and Waterman (1981) J. Molec. Biol 
147:195-197), was used to search for similarities between one sequence from the query 

2 5 and a group of sequences contained in the database. 

A protein sequence of Lecithin: cholesterol acyltransferase from human (McLean 
J, et ai (1986) Nucleic Acids Res. 14(23):9397-406 SEQ ID NO:l)) was used to search the 
NCBI non-redundant protein database using BLAST. Three sequences were identified 
from Arabidopsis, GenBank accessions AC004557 ( referred to herein as LCAT1, SEQ ID 

3 0 NO:2), AC003027 (referred to herein as LCAT2, SEQ ED NO:4), and AL024486 (referred 

to herein as LCAT3, SEQ ID NO:6). The deduced amino acid sequences are provided in 
SEQ ID NOs: 3, 5, and 7, respectively. 

The profile generated from the queries using PSI-BLAST was excised from the 
hyper text markup language (html) file. The worldwide web (www)/html interface to 
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psiblast at ncbi stores the current generated profile matrix in a hidden field in the html file 
that is returned after each iteration of psiblast However, this matrix has been encoded into 
string62 (s62) format for ease of transport through html. String62 format is a simple 
conversion of the values of the matrix into html legal ascii characters, 
5 The encoded matrix width (x axis) is 26 characters, and comprise the consensus 

characters, the probabilities of each amino acid in the order A3,C,D,E,F,G,H,I,K,L,M,N, 
P,Q,R,S,T,V,W,X,Y,Z (where B represents D and N, and Z represents Q and E, and X 
represents any amino acid), gap creation value, and gap extension value. 

The length (y axis) of the matrix corresponds to the length of the sequences 
1 0 identified by PSI-BLAST. The order of the amino acids corresponds to the conserved 
amino acid sequence of the sequences identified using PSI-BLAST, with the N-tenninal 
end at the top of the matrix. The probabilities of other amino acids at that position are 
represented for each amino acid along the x axis, below the respective single letter amino 
acid abbreviation. 

1 5 Thus, each row of the profile consists of the highest scoring (consensus) amino 

acid, followed by the scores for each possible amino acid at that position in sequence 
matrix, the score for opening a gap that that position, and the score for continuing a gap at 
that position. 

The string62 file is converted back into a profile for use in subsequent searches, 
2 0 The gap open field is set to 1 1 and the gap extension field is set to 1 along the x axis. The 
gap creation and gap extension values are known, based on the settings given to the PSI- 
BLAST algorithm. The matrix is exported to the standard GCG profile form. This format 
can be read by Got Web. 

The algorithm used to convert the string62 formatted file to the matrix is outlined 
25 in Table 1. 
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Table 1 

1 . if encoded character z then the value is blast score min 

2. if encoded character Z then the value is blast score max 

3. else if the encoded character is uppercase then its value is (64-(ascii # of char)) 

4. else if the encoded character is a digit the value is ((ascii # of char)-48) 

5. else if the encoded character is not uppercase then the value is ((ascii # of char) - 87) 

6. ALL B positions are set to min of D and N amino acids at that row in sequence matrix 

7. ALL Z positions are set to min of Q amd E amino acids at that row in sequence matrix 

8. ALL X positions are set to min of all amino acids at that row in sequence matrix 

9. kBLAST_SCOREJV[AX=999; 

10. kBLAST_SCOREJvlIN==-999; 

1 1 . all gap opens are set to 1 1 

12. all gap lens are set to 1 


The protein sequences of LCAT1, LCAT2, and LCAT3 as well as the PSI-BLAST 
profile were used to search public and proprietary databases for additional LCAT 
sequences. Two EST sequences were identified which appear to be identical to LCAT1 
and LCAT3, respectively. One additional Arabidopsis sequence was identified from the 

2 0 proprietary databases, LCAT4 (SEQ ID NO:8). The deduced protein sequence of LCAT4 

is provided in SEQ ID NO:9. Two additional genomic sequences were identified using the 
PSI-BLAST profile from libraries of Arabidopsis ecotypes Columbia and Landsberg, 
LCAT7 (SEQ ID NO: 10) and LCAT8 (SEQ ID NO:l 1). The LCAT7 sequence was 
present in both the Columbia and Landsberg genomic libraries, while the LCAT8 sequence 
25 was only present in the Columbia library. 

An open reading frame was predicted from the genomic sequence of LCAT7 in the 
Arabidopsis public database and this sequence was called MSH12 (referred to herein as 
LCAT5, SEQ ID NO: 73). The deduced protein sequence of LCAT5 is provided in SEQ 
ID NO: 74. 

3 0 The PSI-BLAST profile and the LCAT sequences were used to query the public 

yeast database and proprietary libraries containing corn and soy EST sequences. The yeast 
genome contains only one gene, LROl (LCAT Related Open reading frame, YNR008 W, 
Figure 1) with distinct similarity to the human LCAT. The DNA sequence of LROl is 
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provided in SEQ ID NO: 75 and the protein sequence is provided in SEQ ID NO: 76. 
Seven EST sequences were identified from soybean libraries as being LCAT sequences. 
Two sequences from soy (SEQ ID NOs: 12 and 13) are most closely related to the 
Arabidopsis LCAT1 sequence, a single sequence was identified as being most closely 
5 related to LCAT2 (SEQ ID NO: 14) , three were closely related to LCAT3 (SEQ ID NOs: 
15-17), and an additional single sequence was identified (SEQ ID NO:18). A total of 11 
com EST sequences were identified as being related to the Arabidopsis LCAT sequences. 
Two corn EST sequences (SEQ ID NOs: 19 and 20) were most closely related to LCAT1, 
two sequences were identified as closely related to LCAT2 (SEQ ID NOs: 21 and 22), four 
1 0 corn EST sequences were identified as closely related to LCAT3 (SEQ ID NOs: 23-26), 
and an additional three corn EST sequences were also identified (SEQ ID NOs: 27-29). 

Example 3: Identification of ACAT Sequences 

Since plant ACATs are unknown in the art, searches were performed to identify 
15 known and related ACAT sequences from mammalian sources from public databases. 

These sequences were then used to search public and proprietary EST databases to identify 
plant ACAT-like sequences. 

A public database containing mouse Expressed Sequence Tag (EST) sequences 
(dBEST) was searched for ACAT-like sequences. The search identified two sequences 
2 0 (SEQ ID 30 and 3 1 ) which were related (approximately 20% identical), but divergent, to 
known ACAT sequences. 

In order to identify ACAT-like sequences from other organisms, the two mouse 
ACAT sequences were used to search public and proprietary databases containing EST 
sequences from human and rat tissues. Results of the search identified several sequences 

2 5 from the human database and from the rat database which were closely related to the 

mouse sequences. The human and rat ACAT-like EST sequences were assembled, using 
the GCG assembly program, to construct a complete inferred cDNA sequence by 
identifying overlapping sequences (SEQ ID NOs: 32 and 33, respectively). 

The protein sequence of the human ACAT-like sequence was aligned with known 

3 0 ACAT sequences from human (Chang, et al (1993) /. Biol Chem. 268:20747-20755, 

SEQ ID NO:34), mouse (Uelmen, etal. (1995)7. Biol Chem. 270:26192-26201 SEQ ID 
NO:35) and yeast (Yu, et al. (1996) J. Biol Chem. 271:24157-24163, SEQ ID NO:36 and 
Yang, et al (1996) Science 272:1353-1356, SEQ ID NO:37) using MacVector (Oxford 
Molecular, Inc.). Results of the alignment demonstrated that the sequence was related to 
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the known sequences, however the related sequence was only about 25% similar to the 
known sequences. 

The protein sequence of the human sterol O-acyltransferase (ACAT, Acyl 
CoA:Cholesterol acyltransferase, Accession number A48026) related sequence was used 
5 to search protein and nucleic acid Genbank databases. A single plant homologue was 
identified in the public Arabidopsis EST database (Accession A042298, SEQ ID NO:38). 
The protein sequence (SEQ ID NO:39)was translated from the EST sequence, and was 
found to contain a peptide sequence conserved in both mammalian and yeast ACATs 
(Chang et al., (1997) Ann. Rev. Biochem., 66:613-638). 
10 To obtain the entire coding region corresponding to the Arabidopsis AC AT-like 

EST, synthetic oligonucleotide primers were designed to amplify the 5' and 3* ends of 
partial cDNA clones containing ACAT-like sequences. Primers were designed according 
to the Arabidopsis ACAT-like EST sequence and were used in Rapid Amplification of 
cDNA Ends (RACE) reactions (Frohman et aL (1988) Proa Natl Acad. Set USA 
15 85:8998-9002), 

Primers were designed (5 , -TGCAAATTGACGAGCACACCAACCCCTTC-3 , 
(SEQ ID NO:40) and 5 AAGGATGC1TTGAGTTCCTGACAATAGG-3 * (SEQ ID 
NO:41)) to amplify the 5* end from the Arabidopsis ACAT EST sequence. Amplification 
of flanking sequences from cDNA clones were performed using the Marathon cDNA 
2 0 Amplification kit (Clontech, CA). 

The sequence derived from the 5' -RACE amplification was used to search 
proprietary Arabidopsis EST libraries. A single EST accession, LIB25-088-C7 (SEQ ID 
NO:42), was identified which contained a sequence identical to the S'-RACE sequence. 
Furthermore, LDB25-088-C7 was found to contain the complete putative coding sequence 

2 5 for the Arabidopsis ACAT-like product. 

The nucleic acid as well as the putative translation product sequences of A042298 
were used to search public and proprietary databases. Four EST sequences were identified 
in both soybean (SEQ ID NOs:43-46) and maize (SEQ ID NOs;47-50) proprietary 
databases, and a single ACAT-like sequence was identified from Mortierrella alpina EST 

3 0 sequences (SEQ ID NO:5 1 ). 

Sequence alignments between ACAT sequences from several different sources 
were compared to identify the similarity between the sequences. Nucleotide sequences 
from known human and mouse ACATs, as well as nucleotide sequences from known yeast 
ACATs were compared to the ACAT-like EST sequences from human and Arabidopsis. 
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Analysis of the sequence alignments revealed several classes of ACATs based on 
sequence similarity. The known human and mouse ACATs, being 88% similar in the 
nucleotide sequence, formed one class of ACATs. Another class of ACATs included the 
yeast ACATs which are less than 20% similar to the known human and mouse class 
5 ACATs. 

The final class of ACATs included the Arabidopsis and human sequences disclosed 
in the present invention. This class is approximately 22% similar to the known human and 
mouse AC AT class and approximately 23% similar to the yeast class of ACATs. Thus, 
the ACAT sequences disclosed in the present invention represent a novel class of ACAT 
1 0 enzymes. Partial mouse sequences of this class are also provided. 

Example 4: Expression Construct Preparation 

Constructs were prepared to direct expression of the LCAT1, LCAT2, LCAT3, 
LCAT4, LCAT5 and the yeast LROl sequences in plants and cultured insect cells. The 
1 5 entire coding region of each LCAT was amplified from the appropriate EST clone or an 
Arabidopsis genomic cDNA library using the following oligonucleotide primers in a 
polymerase chain reactions (PGR). The LCAT1 coding sequence was amplified from the 
EST clone Lib25-082-Ql-El-G4 using the primers 

5^GGATCCGCGGCCGCACAATGAAAAAAATATCITCACATTATTCGG-3' (SEQ 
20 ID NO:52) and 5 *-GG ATCCCCTGCAGGTC ATTC ATTGACGGC ATTAAC ATTGG-3 * 
(SEQ ED NO:53). The LCAT2 coding sequence was amplified from an Arabidopsis 
genomic cDNA library using the synthetic oligo nucleotide primers 
S'-GGATCCGCGGCCGCACAATGGGAGCGAATTCGAAATCAGTAACG-S' (SEQ 
ID NO:54) and 5 '-GGATCCCCTGCAGGTTAATACCCACT ' 

2 5 (SEQ ID NO:55). The LCAT3 coding sequence was amplified from the EST clone 

LIB22-004-Q1-E1-B4 using the synthetic oligo nucleotide primers 
5 *-GGATCCGCGGCCGC AC AATGTCTCTATTACTGGAA GAGATC-3* (SEQ ID 
NO:56) and S'-GGATCCCCTGCAGGTTATGCATC AACAGAGAC ACTTACAGC-3 * 
(SEQ ID NO:57) . The LCAT4 coding sequence was amplified from the EST clone 

3 0 LIB23-007-Q1 -El -B5 using the synthetic oligo nucleotide primers 

5 *-GG ATCCGCGGCCGC AC AATGGGCTGGATTCCGTGTCCGTGC-3 * (SEQ ID 
NO:58) and S'^ATCCCCTGCAGGTTAACCAGAATCAACtACTTTGTG-S' (SEQ 
ID NO:59). The LCAT5 coding sequence was amplified from LEB23-053-Q1-E1-E3 using 
the synthetic oligo nucleotide primers 
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5'-GGATCCGCGGCCGCACAATGCCCCTTATTCATCGG-3 , (SEQ ID NO:77) and 5*- ' 
GGATCCCCTGC AGGTCAC AGCTTC AGGTC AATACG-3 * (SEQ ID NO:78). 

The yeast LROI coding sequence was amplified from genomic yeast DNA using 
the synthetic oligo nucleotide primers 
5 5 'GGATCCGCGGCCGCACAATGGGCAC ACTGTTTCGAAG3 ' (SEQ ID NO:79) 
and 5 ' GGATCCCCTGC AGGTTACATTGGGAAGGGCATCTGAG3 ' (SEQ ID NO:80). 

The entire coding region of the Arabidopsis ACAT sequence (SEQ ID NO: 42) 
was amplified from the EST clone LIB25-088-C7 using oligonucleotide primers 
5 '-TCGACCTGCAGGAAGCTTAGAAATGGCGATTTTGGATTC-S * (SEQ ID NO: 60) 
10 and 5 '-GGATCCGCGGCCGCTCATGACATCGATCCnTTTCGG-3 * (SEQ ID NO: 61) 
in a polymerase chain reaction (PCR). 

Each resulting PCR product was subcloned into pCR2. lTopo (Invitrogen) and 
labeled pCGN9964 (LCAT1), pCGN9985 (LCAT2), pCGN9965 (LCAT3), pCGN9995 
(LCAT4), pCGN10964 (LCAT5), pCGN10963 (LR01) y and pCGN8626 (ACAT). 
1 5 Double stranded DNA sequence was obtained to verify that no errors were introduced by 
the PCR amplification. 

4A. Baculovirus Expression Constructs 

Constructs are prepared to direct the expression of the Arabidopsis LCAT and 

2 0 yeast LCAT sequences in cultured insect cells. The entire coding region of the LCAT 

proteins was removed from the respective constructs by digestion with Notl and &e8387I, 
followed by gel electrophoresis and gel purification. The fragments containing the LCAT 
coding sequences were cloned into Notl and Pstl digested baculovirus expression vector 
pFastBacl (Gibco-BRL, Gaithersburg, MD). The resulting baculovirus expression 
25 constructs were referred to as pCGN9992 (LCAT1), pCGN9993 (LCAT2), pCGN9994 
(LCAT3), pCGN10900 (LCAT4), pCGN10967 (LCAT5), and pCGN10962 {LROI). 

4B. Plant Expression Construct Preparation 

A plasmid containing the napin cassette derived from pCGN3223 (described in 

3 0 U.S. Patent No. 5,639,790, the entirety of which is incorporated herein by reference) was 

modified to make it more useful for cloning large DNA fragments containing multiple 
restriction sites, and to allow the cloning of multiple napin fusion genes into plant binary 
transformation vectors. An adapter comprised of the self annealed oligonucleotide of 
sequence 5'- 
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CGCGATTTAAATGGCGCGCCCTGCAGGCGGCCGCCTGCAGGGCGCGCCATTTA 
AAT-3' (SEQ ID NO:62) was ligated into the cloning vector pBC SK+ (Stratagene) after 
digestion with the restriction endonuclease BssHII to construct vector pCGN7765. 
Plamids pCGN3223 and pCGN7765 were digested with NotI and ligated together. The 
5 resultant vector, pCGN7770, contained the pCGN7765 backbone with the napin seed 
specific expression cassette from pCGN3223. 

The cloning cassette, pCGN7787, contained essentially the same regulatory 
elements as pCGN7770, with the exception of the napin regulatory regions of pCGN7770 
have been replaced with the double CAMV 35S promoter and the tml polyadenylation and 

1 0 transcriptional termination region. 

A binary vector for plant transformation, pCGN5 1 39, was constructed from 
pCGNl 558 (McBride and Summerfelt, (1 990) Plant Molecular Biology, 1 4:269-276). In 
pCGN5139, the polylinker of pCGN1558 was replaced as a HindIH/Asp718 fragment with 
a polylinker containing unique restriction endonuclease sites, AscI, Pad, Xbal, Swal, 

1 5 BamHI,and NotI. The Asp718 and Hindm restriction endonuclease sites are retained in 
pCGN5139. 

A series of turbo binary vectors was constructed to allow for the rapid cloning of 
DNA sequences into binary vectors containing transcriptional initiation regions 
(promoters) and transcriptional termination regions. 

2 0 The plasmid pCGN861 8 was constructed by ligating oligonucleotides 

5 '-TCG AGGATCCGCGGCCGCAAGCTTCCTGCAGG-3 * (SEQIDNO:63) and 
5 '-TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC-S * (SEQ ID NO:64) into 
Sall/XhoI-digested pCGN7770. A fragment containing the napin promoter, polylinker and 
napin 3* region was excised from pCGN8618 by digestion with Asp718I; the fragment 

2 5 was blunt-ended by filling in the 5* overhangs with Klenow fragment then ligated into 

pCGN5139 that had been digested with Asp718I and Hindm and blunt-ended by filling in 
the 5* overhangs with Klenow fragment. A plasmid containing the insert oriented so that 
the napin promoter was closest to the blunted Asp718I site of pCGN5139 and the napin 3' 
was closest to the blunted Hindin site was subjected to sequence analysis to confirm both 

30 the insert orientation and the integrity of cloning junctions. The resulting plasmid was 
designated pCGN8622. 

The plasmid pCGN8619 was constructed by ligating oligonucleotides 
5 * -TCG ACCTGC AGGAAGCTTGCGGCCGCGG ATCC-3 * (SEQ ID NO:65) and 
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5*-TCGAGGATCCGCGGCCGC AAGCTTCCTGCAGG-3 ' (SEQ ID NO:66) into 
Sall/XhoI-digested pCGN7770. A fragment containing the napin promoter, polylinker and 
napin 3' region was removed from pCGN8619 by digestion with Asp718I; the fragment 
was blunt-ended by filling in the 5* overhangs with Klenow fragment then ligated into 
5 pCGN5 139 that had been digested with Asp71 81 and Hindlll and blunt-ended by filling in 
the 5* overhangs with Klenow fragment. A plasmid containing the insert oriented so that 
the napin promoter was closest to the blunted Asp718I site of pCGN5139 and the napin 3* 
was closest to the blunted Hindm site was subjected to sequence analysis to confirm both 
the insert orientation and the integrity of cloning junctions. The resulting plasmid was 

10 designated pCGN8623. 

The plasmid pCGN8620 was constructed by Ifgating oligonucleotides 
5'-TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGGAGCT -3' (SEQ ID NO:67) and 
5^CCTGCAGGAAGCTTGCGGCCGCGGATCC-3' (SEQ ID NO:68) into Sall/SacI- 
digested pCGN7787. A fragment containing the d35S promoter, polylinker and tml 3' 

15 region was removed from pCGN8620 by complete digestion with Asp71 81 and partial 
digestion with NotL The fragment was blunt-ended by filling in the 5' overhangs with 
Klenow fragment then ligated into pCGN5139 that had been digested with Asp71 81 and 
HindHI and blunt-ended by filling in the 5' overhangs with Klenow fragment A plasmid 
containing the insert oriented so that the d35S promoter was closest to the blunted 

20 Asp718I site of pCGN5139 and the tml 3* was closest to the blunted Hindffl site was 
subjected to sequence analysis to confirm both the insert orientation and the integrity of 
cloning junctions. The resulting plasmid was designated pCGN8624. 

The plasmid pCGN8621 was constructed by ligating oligonucleotides 
5 '-TCGACCTGCAGGAAGCTTGCGGCCGCGGATCCAGCT -3* (SEQ ID NO:69) and 

2 5 5 *-GG ATCCGCGGCCGC AAGCTTCCTGC AGG-3 * (SEQ ID NO:70) into Sall/Sacl- 

digested pCGN7787. A fragment containing the d35S promoter, polylinker and tml 3' 
region was removed from pCGN8621 by complete digestion with Asp71 8Iand partial 
digestion with NotL The fragment was blunt-ended by filling in the 5' overhangs with 
Klenow fragment then ligated into pCGN5139 that had been digested with Asp718I and 

3 0 Hindlll and blunt-ended by filling in the 5 * overhangs with Klenow fragment. A plasmid 

containing the insert oriented so that the d35S promoter was closest to the blunted 
Asp71 81 site of pCGN5 139 and the tml 3' was closest to the blunted Hindffl site was 
subjected to sequence analysis to confirm both the insert orientation and the integrity of 
cloning junctions. The resulting plasmid was designated pCGN8625. 
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The plasmid construct pCGN8640 is a modification of pCGN8624 described 
above, A 938bp PstI fragment isolated from transposon Tn7 which encodes bacterial 
spectinomycin and streptomycin resistance (Fling et al. (1985), Nucleic Acids Research 
13(19):7095-7106), a determinant for E. coli and Agrobacterium selection, was blunt 
ended with Pfu polymerase. The blunt ended fragment was ligated into pCGN8624 that 
had been digested with Spel and blunt ended with Pfii polymerase. The region containing 
the PstI fragment was sequenced to confirm both the insert orientation and the integrity of 
cloning junctions. 

The spectinomycin resistance marker was introduced into pCGN8622 and 
pCGN8623 as follows. A 7.7 Kbp Avrll-SnaBI fragment from pCGN8640 was ligated to 
a 10.9 Kbp Avrll-SnaBI fragment from pCGN8623 or pCGN8622, described above. The 
resulting plasmids were pCGN8641 and pCGN8643, respectively. 

The plasmid pCGN8644 was constructed by ligating oligonucleotides 
5 '-GATC ACCTGCAGGAAGCTTGCGGCCGCGGATCCAATGC A-3 ' (SEQ ED NO:71) 
and 5 '-TTGGATCCGCGGCCGCAAGCTTCCTGCAGGT-3 , (SEQ ID NO:72) into 
BamHI-PstI digested pCGN8640. 

4C. Plant LCAT Expression Construct Preparation 

The coding sequence of LCAT1 was cloned from pCGN9964 as a NotV &e8387I 
2 0 fragment into pCGN8640, pCGN8641 , pCGN8643, and pCGN8644 to create the 

expression constructs pCGN9960, pCGN9961, pCGN9962, and pCGN9963, respectively. 
The construct pCGN9960 was designed to express the LCAT1 coding sequence in the 
sense orientation from the constitutive promoter CaMV 35S. The construct pCGN9961 
was designed to express the LCAT1 coding sequence in the antisense orientation from the 

2 5 napin promoter. The construct pCGN9962 was designed to express the LCAT1 coding 

sequence in the sense orientation from the napin promoter. The construct pCGN9963 was 
designed to express the LCAT1 coding sequence in the antisense orientation from the 
constitutive promoter CaMV 35S. 

The coding sequence of LCAT2 was cloned from pCGN9985 as a NotV &e8387I 

3 0 fragment into pCGN8640, pCGN8641, pCGN8643, and pCGN8644 to create the 

expression constructs pCGN9981, pCGN9982, pCGN9983, and pCGN9984, respectively. 
The construct pCGN998 1 was designed to express the LCAT2 coding sequence in the 
sense orientation from the constitutive promoter CaMV 35S. The construct pCGN9982 
was designed to express the LCAT2 coding sequence in the antisense orientation from the 


10 
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napin promoter. The construct pCGN9983 was designed to express the LCAT2 coding 
sequence in the sense orientation from the napin promoter. The construct pCGN9984 was 
designed to express the LCAT2 coding sequence in the antisense orientation from the 
constitutive promoter CaMV 35S. 
5 The coding sequence of LCAT3 was cloned from pCGN9965 as a NotV &<?8387I 

fragment into pCGN8640, pCGN8641, pCGN8643, and pCGN8644 to create the 
expression constructs pCGN9966, pCGN9967, pCGN9968, and pCGN9969, respectively. 
The construct pCGN9966 was designed to express the LCAT3 coding sequence in the 
sense orientation from the constitutive promoter CaMV 35S. The construct pCGN9967 

1 0 was designed to express the LCAT3 coding sequence in the antisense orientation from the 
napin promoter. The construct pCGN9968 was designed to express the LCAT3 coding 
sequence in the sense orientation from the napin promoter. The construct pCGN9969 was 
designed to express the LCAT3 coding sequence in the antisense orientation from the 
constitutive promoter CaMV 35S. 

15 The coding sequence of LCAT4 was cloned from pCGN9995 as a NotV &e8387I 

fragment into pCGN8640, pCGN864i, pCGN8643, and pCGN8644 to create the 
expression constructs pCGN9996, pCGN9997, pCGN9998, and pCGN9999, respectively. 
The construct pCGN9996 was designed to express the LCAT4 coding sequence in the 
sense orientation from the constitutive promoter CaMV 35S. The construct pCGN9997 

2 0 was designed to express the LCAT4 coding sequence in the antisense orientation from the 
napin promoter. The construct pCGN9998 was designed to express the LCAT4 coding 
sequence in the sense orientation from the napin promoter. The construct pCGN9999 was 
designed to express the LCAT4 coding sequence in the antisense orientation from the 
constitutive promoter CaMV 35S. 

25 The coding sequence of LCAT5 was cloned from pCGN10964 as a NotV &e8387I 

fragment into pCGN9977 and pCGN9979, to create the expression constructs 
pCGN10965, and pCGN10966, respectively. The construct pCGN10965 was designed to 
express the LCAT5 coding sequence in the sense orientation from the constitutive 
promoter CaMV 35S. The construct pCGN10966 was designed to express the LCAT5 

30 coding sequence in the sense orientation from the napin promoter. 

The coding sequence of LROl was cloned from pCGN10963 as a NotV &e8387I 
fragment into pCGN9977 and pCGN9979 f to create the expression constructs 
pCGN10960, and pCGN10961 , respectively. The construct pCGN10960 was designed to 
express the LROl coding sequence in the sense orientation from the constitutive promoter 
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CaMV 35S. The construct pCGN10961 was designed to express the LROl coding 
sequence in the sense orientation from the napin promoter. 

4D. Plant ACAT Expression Construct Preparation 
5 A fragment containing the Arabidopsis ACAT-like coding region was removed 

from pCGN8626 by digestion with Sse8387I and Not I. The fragment containing the 
ACAT-like sequence was ligated into Pstl-Not I digested pCGN8622. The resulting 
plasmid was designated pCGN8627. DNA sequence analysis confirmed the integrity of 
the cloning junctions. 

10 A fragment containing the Arabidopsis ACAT-like coding region (SEQ ID NO: 

42) was removed from pCGN8626 by digestion with Sse8387I and Not I. The fragment 
was ligated into Pstl-Not I digested pCGN8623. The resulting plasmid was designated 
pCGN8628. DNA sequence analysis confirmed the integrity of the cloning junctions. 

A fragment containing the Arabidopsis ACAT-like coding region was removed 
1 5 from pCGN8626 by digestion with Sse8387 and Not I. The fragment was ligated into 
Pstl-Not I digested pCGN8624. The resulting plasmid was designated pCGN8629. DNA 
sequence analysis confirmed the integrity of the cloning junctions. 

A fragment containing the Arabidopsis ACAT-like coding region was removed 
from pCGN8626 by digestion with Sse8387 and Not I. The fragment was ligated into 
2 0 Pstl-Not I digested pCGN8625. The resulting plasmid was designated pCGN8630. DNA 
sequence analysis confirmed the integrity of the cloning junctions. 

An additional expression construct for the suppression of endogenous ACAT-like 
activity was also prepared. The construct pCGN8660 was constructed by cloning 
approximately 1 Kb of the Arabidopsis ACAT-like coding region from pCGN8626 in the 

2 5 sense orientation, and the full-length Arabidopsis ACAT-like coding region in the 

antisense orientation under the regulatory control of the napin transcription initiation 
sequence. 

For expression of the rat ACAT-like sequence in plants, the NotI-Sse8387I 
fragment of pCGN8592 was cloned into Notl-PstI digested binary vectors pCGN8621, 

3 0 pCGN8622, and pCGN8624 to yield plasmids, pCGN 9700, pCGN9701, and pCGN9702, 

respectively. Plasmid pCGN9700 expresses a sense transcript of the rat ACAT-like cDNA 
under control of a napin promoter, plasmid pCGN9701 expresses an antisense transcript of 
the rat ACAT-like cDNA under control of a napin promoter, and plasmid pCGN9702 
expresses a sense transcript of the rat ACAT-like cDNA under control of a double 35S 
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promoter. Plasmids pCGN 9700, pCGN9701, and pCGN9702 were introduced in 
Agrobacterium tumefaciens EHA101 . 

Constructs were prepared to direct the expression of the rat ACAT-like sequence in 
the seed embryo of soybean and the endosperm of corn. For expression of the rat ACAT- 
5 like DNA sequence in soybean, a 1 .5 fcb Notl/SseZ3$7I fragment from pCGN8592 
containing the coding sequence of the rat ACAT-like sequence was blunt ended using 
Mung bean nuclease, and ligated into the Smal site of the tuibo 7S binary/cloning vector 
pCGN8809 to create the vector pCGN8817 for transformation into soybean by particle 
bombardment. The vector pCGN88 17 contained the operably linked components of the 

1 0 promoter region of the soybean a* subunit of p-conglycinin (7S promoter, (Chen et al. t 
(1986), Proa Natl Acad. ScL f 83:8560-8564), the DNA sequence coding for the entire rat 
ACAT-like protein, and the transcriptional termination region of pea RuBisCo small 
subunit, referred to as E9 3* (Coruzzi, et al (1984) EMBOJ. 3:1671-1679 and Morelli, et 
al (1985) Nature 315:200-204). This construct further contained sequences for the 

1 5 selection of positive transformed plants by screening for resistance to glyphosate using the 
CP4 EPSPS (U.S. Patent 5,633,435) expressed under the control of the figwort mosaic 
virus (FMV) promoter (U.S. Patent Number 5,378,619) and the transcriptional termination 
region of E9. 

For expression of the rat ACAT-like sequence in the corn endosperm, a 1 .5 kb 
2 0 NotVSse83S7l fragment from pCGN8592 containing the coding sequence of the rat 
ACAT-like sequence was blunt ended using Mung bean nuclease, and ligated into the 
BamHl site of the rice pGtt expression cassette pCGN8592 for expression from the pGtl 
promoter (Leisy, D.J. et al., Plant Mol. Biol. 14 (1989) 41-50) and the HSP70 intron 
sequence (U.S. Patent Number 5,593,874). This cassette also included the transcriptional 

2 5 termination region downstream of the cloning site of nopaline synthase, nos 3* (Depicker 

et al., J. Molea Appl Genet (1982) 1 : 562-573). A 7.5 kb fragment containing the pGtl 
promoter, the DNA sequence encoding the rat ACAT-like protein, and the nos 
transcriptional termination sequence was cloned into the binary vector pCGN8816 to 
create the vector pCGN88 1 8 for transformation into corn. This construct also contained 

3 0 sequences for the selection of positive transformants with kanamycin using the kanamycin 

resistance gene from Tn5 bacteria under the control of the CAMV 35S promoter and tail 
transcriptional termination regions. 
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Example 5: Expression in Insect Cell Culture 

A baculovirus expression system was used to express the LCAT cDNAs in 
cultured insect cells. 

The baculovirus expression constructs pCGN9992, pCGN9993, pCGN9994, 
5 pCGN10900, pCGN10962, and pCGN10967 were transformed and expressed using the 
BAC-to-B AC Baculovirus Expression System (Gibco-BRL, Gaithersburg, MD) according 
to the manufacturer's directions. 

The transformed insect cells were used to assay for acyltransferase activities using 
methods known in the art (see Example 8). 

10 

Example 6: Plant Transformation 

A variety of methods have been developed to insert a DNA sequence of interest into 
the genome of a plant host to obtain the transcription or transcription and translation of the 
sequence to effect phenotypic changes. Transgenic plants were obtained by Agrobacterium- 
1 5 mediated transformation as described by Radke et al (Theor. Appl Genet. (1 988) 75:685-694; 
Plant Cell Reports (1992) 77:499-505). Alternatively, microprojectile bombardment methods, 
such as described by Klein et al {Bio/Technology 70:286-291) may also be used to obtain 
nuclear transformed plants. Other plant species may be similarly transformed using related 
techniques. 

2 0 The plant binary constructs described above were used in plant transformation to direct 

the expression of the sterol acyltransferases in plant tissues. Binary vector constructs were 
transformed into strain EHA101 Agrobacterium cells (Hood et al, J. Bacteriol (1986) 
7^5:1291-1301), by the method of Holsters et al (Mol Gen. Genet (1978) 753:181-187). 
Transgenic Arabidopsis thaliana plants were obtained by Agrobacterium-mediBted 

2 5 transformation as described by Valverkens et al, (Proa Nat Acad Sci. (1 988) £5:5536-5540), 
Bent et ah ((1994), Science 265:1 856-1860), and Bechtold et al. ((1993), C R.Aca£ ScL. Life 
Sciences 316:1 194-1 199). 

Example 7; Plant Assays for Modified Sterol Content/Profile 
30 7A: NMR of T2 seed 

Seed from plants expressing LCAT 1 through 4 under the control of the napin 
promoter were analyzed by NMR. Arabidopsis seeds from transgenic plants were placed 
directly into wide-mouth MAS NMR sample tubes. 
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High-resolution spectra were measured at 1 1.7 T (1H=500 MHz, 130125 mHz) 
using Varian NMR Instruments (Palo Alto, CA) Inova™ NMR spectrometers equipped 
with carbon-observe MAS Nanoprobes™. The 1 3C spectra were acquired without a field- 
frequency lock at ambient temperature (approx. 21-22°C) for 14 hours using the following 
5 conditions: spectral width = 29.996 kHz, acquisition time « 2. 1 85 seconds, p/2 pulse (3.8 
ms) with no relaxation delay, 1H g B2 = 2.5 kHz with Waltz decoupling. Data processing 
conditions were typically: digital resolution = 0.1 1 Hz, 0.3 to 1.5 Hz line broadening and 
time-reversed linear prediction of the first three data points. Chemical shifts were 
referenced by adding neat tetramethylsilane (TMS) to Arabidopsis seeds and using the 

1 0 resulting referencing parameters for subsequent spectra. The 13C resolution was 2-3 Hz 
for the most narrow seed resonances. Spectral resolution was independent of MAS 
spinning speeds (1 .5-3.5 kHz) and data were typically obtained with 1 .5 kHz spinning 
speeds. Spinning sidebands were approx. 1% of the main resonance. Phytosterol 13C 
assignments were based on model samples composed of triolein, p-sitosterol and 

15 cholesterol oleate. Triacylglycerol 13C assignments were made from comparison with 
literature assignments or with shifts computed from a 13C NMR database (Advanced 
Chemical Development, Inc., version 3.50, Toronto Canada). 

The results of these analyses are displayed in Figure 2 and show that there was a 
trend of an approximately 2 fold increase of phytosterols in the seeds derived from plant 

2 0 line 5 expressing the LCAT 4 gene (pCGN9998) under the control of the napin promoter. 
During the course of this analysis it was also noted that the average oil content of seed 
from plants expressing the LCAT2 construct (pCGN9983) under the control of the napin 
promoter was higher than that of controls. This is the first in planta evidence supporting 
the concept that overexpression of a nucleotide sequence encoding a lecithin.cholesterol 

2 5 acyltransferase-like polypeptide can increase oil content. 

7B: HPLC/MS of T2 seed 

Seed oil from T2 plants expressing LCAT1 through 4 under the control of the 
napin promoter was extracted using an accelerated solvent extractor (ASE) method. Seed 

3 0 samples were ground, using a mortar and pestle, to achieve a fine homogeneous meaL Oil 

was obtained using a Dionex Accelerated Solvent Extractor (ASE). Clean ground seed 
was added to an equal amount of diatomaceous earth. The ground seed sample and the 
diatomaceous earth were thoroughly mixed until a homogeneous texture was achieved. 
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The sample was then loaded into the instrument and oil extraction was achieved using 
hexane under validated laboratory protocols. 

Oil Scorn these seed samples was then analyzed for sterol ester analysis using 
HPLC/MS for free campesterol, stigmasterol, and sitosterol and their fatty acid esters. To 
5 the autosampler vial containing approximately 0.1 grams oil was added 0.3 mLs CDC1 3 . 
One-hundred microliters of this solution was added to 900 microliters CHCI 3 . Five 
microliters of this diluted sample was subsequently injected into an HPLC/MS with 
positive ion atmospheric pressure ionization. The individual components in the oils were 
separated using two 4.6 x 50 mm C 8 Zorbax columns in series and a gradient using 

1 0 acetonitrile and acetonitrile with 40% CHC1 3 . The sterol concentrations were calculated 
assuming each sterol and its fatty acids have the same molar responses. This was observed 
to be the case with cholesterol and its esters and was assumed to be the case for 
campesterol, stigmasterol, and sitosterol. In the present study, the sterol identified as 
stigmasterol was actually an isomer of this compound. 

15 The results of these analyses are displayed in Figures 3 and 4 and show that there 

were sterol ester enhancements on the order of 50%. in the seeds derived from six out of 
seven T2 plant lines expressing LCAT3 (pCGN9968) under the control of the napin 
promoter. 

2 0 Example 8: Baculovirus Insect Cell Culture for Sterol Esterification Activity 

Baculovirus expression construct pCGN9992, pCGN9993, pCGN9994 and 
pCGN10900 (see Example 4) were transformed and expressed using the BAC-TOBAC 
Baculovirus Expression System (Gibco-BRL, Gaithersburg, MD) according to the 
manufacturer's instructions except harvesting of recombinant viruses was done 5 days 

25 post-tiansfection. The supernatant from the transfection mixture was used for generating 
virus stock which in turn was used for infecting Sf9 cells used in the assay. 

The transformed cells were assayed for lecithinrsterol acyitransferase activities 
using the method described herein. Insect cells were centrifiiged and the resulting cell 
pellet was either used immediately or stored at -70 C for later analysis. Cells were 

30 resuspended in Medium A (100 mM Tricine/NaOH, pH 7.8, 10% (w/v) glycerol, 280 mM 
NaCl with : 0.1 jiM Aprotinin, 1 ^M Leupeptin, and 100 pM Pefabloc (all from 
Boehringer Mannheim, Germany) and lysed by sonication (2x10 sec). Cell walls and 
other debris were pelleted by centrifiigation (14,000 x g , 10 min, 4°C). The supernatant 
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was transferred to a new vial and membranes pelleted by centrifogation (100,000 x g, Ti 
70.1 rotor, 46,000 rpm for 1 hour at 4°C). Total membranes were resuspended in Medium 
A. Lecithinrsterol acyltransferase activity was assayed in a 0.1 ml reaction mixture 
containing 100 mM Tris/HCl, pH 7, 28 mM NaCI, 0.03% Triton X-100, 0.1 mM sitosterol, 
5 20 jxM l,2-["C]-palmitoyl-phosphatidyl choline (246420 dpm/nmole), and 0.05-20 mg of 
membrane protein. After 15 minutes at 30 °C, the reaction was terminated by addition of a 
0.5 ml solution of methylene chloridermethanol (4:1, v/v ) containing 100 \xg cholesterol 
and cholesterol ester as cold carriers. A portion (0.1 ml) of the bottom organic layer was 
removed and evaporated under nitrogen gas. The concentrated extract was resuspended in 

10 30 nl of hexane and spotted onto a silica gel-G thin layer chromatographic plate. The 
plate was migrated in hexane:diethyl ethenacetic acid (80:20: 1) to the top, then air dried. 
Radioactivity was determined by exposure to a Low Energy Phosphor-imaging Screen. 
Following exposure, the screen was read on a phosphorimager. 

The LCAT 4 protein from pCGN10900 in baculovirus membranes showed a 

15 radioactive spot in the region of the TLC plate where cholesterol ester migrates indicating 
that LCAT 4 has the ability to catalyze the transfer of an acyl group from lecithin (PC) to 
sitosterol to make a sitosterol ester. 

Example 9: Plant Assay for Modified Lipid Content 
2 0 Nir (near infrared spectroscopy spectral scanning) can be used to determine the 

total oil content of Arabidopsis seeds in a non-destructive way provided that a spectral 
calibration curve has been developed and validated for seed oil content. A seed oil 
spectral calibartion curve was developed using seed samples from 85 Arabidopsis plants. 
Seed was cleaned and scanned using a Foss NIR model 6500 (Foss-Nirs Systems, Inc.). 

2 5 Approximately 50 to 100 milligrams of whole seeds, per sample, were packed in a mini 

sample ring cup with quartz lens [ IH-0307 ] consisting a mini-insert [ IH-0337 ] and 
scanned in reflectance mode to obtain the spectral data. The seed samples were then 
ground, using a mortar and pestle, to achieve a fine homogeneous meal. The ground 
samples were measured for oil using an accelerated solvent extractor (ASE). 

3 0 Measurement for the total oil content was performed on the Dionex Accelerated 

Solvent Extractor (ASE). Approximately 500 mg of clean ground seed was weighed to the 
nearest 0.1 mg onto a 9 x 9 cm weigh boat An equal amount of diatomaceous earth was 
added using a top-loading balance accurate to the nearest 0.01 g. The ground seed sample 
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and the diatomaceous earth were thoroughly mixed until a homogeneous texture was 
achieved. The sample was loaded on to the instrument and oil extraction was achieved 
using hexane under validated laboratory protocols. Standard Rapeseed samples were 
obtained from the Community Bureau of Reference (BCR). The ASE extraction method 
5 was validated using the BCR reference standards. A total percent oil recovery of 99% to 
100% was achieved. "As-is" oil content was calculated to the nearest 0.01 mass 
percentage using the formula: 

Oil Content = 100% x (vial plus extracted oil wt - initial vial wt) / (sample wt) 

10 

The analytical data generated by ASE were used to perform spectral calibrations. 
Nir calibration equations were generated using the built-in statistical package within the 
NirSytems winisi software. The spectral calibration portion of the software is capable of 
calibration and self-validation. From a total of 85 samples, 57 samples were used to 

1 5 generate the total percent oil calibration. The remaining samples were used to validate the 
oil calibrations. Optimized smoothing, derivative size, and mathematical treatment 
(modified partial least square) was utilized to generate the calibration. The samples that 
were not used in building respective calibrations- were used as a validation set. Statistical 
tools such as correlation coefficient ( R ), coefficient of determination (R 2 ), standard error 

2 0 of prediction ( SEP ), and the standard error of prediction corrected for bias (SEPC) were 
used to evaluate the calibration equations. 

T2 seeds from plants that had been transformed with the LCAT genes were cleaned 
and scanned using a Foss NIR model 6500 (Foss-Nirs Systems, Inc.). Approximately 50 to 
1 00 milligrams of whole seeds, per sample, were packed in a mini sample ring cup with 

2 5 quartz lens [ IH-0307 ] consisting a mini-insert [ IH-0337 ] and scanned in reflectance 

mode to obtain the spectral data. Oil percentage in each seed sample was determined 
using the seed oil spectral calibration curve detailed above. 

The results of these analyses are found in Figure 5 and Table 2 and show that there 
was a significant increase in the oil level in seed from T2 plants expressing the LCAT2 

3 0 gene. This increase in oil was seen in plants when LCAT2 was driven by either the 35S 

constitutive promoter or the seed-specific napin promoter. These results show that 

o verexpression of a nucleic acid sequence encoding a lecithinxholesterol acyltransferase- 

like polypeptide can increase seed oil production in plants. 
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Table 2 



Construct number 

Seed Oil Percentage (%) 

CONTROL 


24.7 

CONTROL 


28.0 

CONTROL 


31.8 

CONTROL 


32.4 

NAPINLCAT1 

PCGN9962 

28.5 

NAPIN LCAT1 

PCGN9962 

28.9 

NAPIN LCAT1 

PCGN9962 

29.6 

NAPIN LCAT1 

PCGN9962 

30.1 

NAPIN LCAT1 

PCGN9962 

30.1 

NAPIN LCAT1 

PCGN9962 

30.1 

NAPIN LCAT1 

PCGN9962 

30.8 

NAPIN LCAT1 

PCGN9962 

31.0 

NAPIN LCAT1 

pCGN9962 

32.1 

NAPIN LCAT1 

pCGN9962 

34.2 

NAPIN LCAT3 

pCGN9968 

26.8 

NAPIN LCAT3 

pCGN9968 

27.4 

NAPIN LCAT3 

pCGN9968 

29.0 

NAPIN LCAT3 

pCGN9968 

29.0 

NAPIN LCAT3 

pCGN9968 

32.6 

NAPIN LCAT2 

pCGN9983 

26.5 

NAPIN LCAT2 

pCGN9983 

34.7 

NAPIN LCAT2 

pCGN9983 

34.8 

NAPIN LCAT2 

pCGN9983 

35.7 

NAPIN LCAT2 

pCGN9983 

35.8 

NAPIN LCAT2 

pCGN9983 

36.3 

NAPIN LCAT2 

pCGN9983 

36.7 

NAPIN LCAT2 

pCGN9983 

37.0 

NAPIN LCAT2 

pCGN9983 

372 

NAPIN LCAT2 

pCGN9983 

37.3 

NAPIN LCAT2 

pCGN9983 

37.3 

NAPIN LCAT2 

pCGN9983 

37.4 

NAPIN LCAT2 

pCGN9983 

37.8 

NAPIN LCAT2 

pCGN9983 

38.0 

NAPIN LCAT2 

pCGN9983 

38.0 

35SLCAT2 

pCGN9981 

27.3 

35S LCAT2 

pCGN9981 

28.1 

35S LCAT2 

pCGN9981 

28.2 

35S LCAT2 

pCGN9981 

28.6 

35S LCAT2 

pCGN9981 

29.8 

35S LCAT2 

pCGN9981 

30.3 

35S LCAT2 

pCGN9981 

32.4 

35S LCAT2 

pCGN9981 

32.5 

35S LCAT2 

pCGN9981 

33.6 

35S LCAT2 

pCGN9981 

34.1 

35S LCAT2 

pCGN9981 

35.5 

35S LCAT2 

pCGN9981 

36.4 

35S LCAT2 

pCGN998l 

37.1 

35S LCAT2 

PCGN9981 

38.3 
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35S LCAT2 

pCGN9981 

38.5 

35S LCAT2 

PCGN9981 

39.1 


In light of the detailed description of the invention and the examples presented 
above, it can be appreciated that the several aspects of the invention are achieved. 
5 It is to be understood that the present invention has been described in detail by way 

of illustration and example in order to acquaint others skilled in the art with the invention, 
its principles, and its practical application. Particular formulations and processes of the 
present invention are not limited to the descriptions of the specific embodiments 
presented, but rather the descriptions and examples should be viewed in terms of the 

1 0 claims that follow and their equivalents. While some of the examples and descriptions 
above include some conclusions about the way the invention may function, the inventors 
do not intend to be bound by those conclusions and functions, but put them forth only as 
possible explanations. 

It is to be further understood that the specific embodiments of the present invention 

15 as set forth are not intended as being exhaustive or limiting of the invention, and that many 
alternatives, modifications, and variations will be apparent to those of ordinary skill in the 
art in light of the foregoing examples and detailed description. Accordingly, this invention 
is intended to embrace all such alternatives, modifications, and variations that fall within 
the spirit and scope of the following claims. 
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What is claimed is: 

1 . An isolated nucleic acid sequence comprising a polynucleotide encoding a plant 
lecithinxholesterol acyltransferase-like polypeptide or fragment thereof. 

2. The isolated nucleic acid sequence of claim 1, wherein said plant lecithinxholesterol 
acyltransferase-like polypeptide is selected from the group consisting of Arabidopsis, 
soybean and corn. 

3. An isolated nucleic acid sequence comprising a polynucleotide encoding a plant acyl 
CoAxholesterol acyltransferase-like polypeptide. 

4. The isolated nucleic acid sequence of claim 3, wherein said polynucleotide is SEQ ID 
NO: 42 or degenerate variants thereof. 

5. The isolated nucleic acid sequence of claim 1, wherein said polynucleotide is selected 
from the group consisting of SEQ ID NO: 2, 4, 6, 8, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 
19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 43, 44, 45, 46, 47, 48, 49, 50, 51, 73 and 75 
or degenerate variants thereof. 

6. An isolated nucleic acid sequence consisting essentially of SEQ ID NO: 2, 4, 6, 8, 10, 
1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 42, 43, 44, 45, 
46, 47, 48, 49, 50, 51, 73 or 75. 

7. An isolated nucleic acid sequence consisting of SEQ ID NO: 2, 4, 6, 8, 10, 1 1, 12, 13, 
14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 42, 43, 44, 45, 46, 47, 48, 
49, 50, 51, 73 or 75. 

8. An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO 3 or SEQ ID 
NO 3 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 2; 

c) an isolated polynucleotide that has at least 70% sequence identity to SEQ ID 
NO: 2; 
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d) an isolated polynucleotide that has at least 80% sequence identity to SEQ ID 
NO: 2; 

e) an isolated polynucleotide that has at least 90% sequence identity to SEQ ED 
NO: 2; 

f) an isolated polynucleotide that has at least 95% sequence identity to SEQ ID 
NO: 2; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 2; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 2 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-^VCR^^-Y 3\ where X is hydrogen, Y is hydrogen or a metal, 
R, and R 3 are any nucleic acid, n is an integer between 0-3000, and R 2 is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 3 or SEQ 
ID NO: 3 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 2; 

c) an isolated polynucleotide that has at least 70% sequence identity to SEQ ID 
NO: 2; 

d) an isolated polynucleotide that has at least 80% sequence identity to SEQ DO 
NO: 2; 

e) an isolated polynucleotide that has at least 90% sequence identity to SEQ ID 
NO: 2; 

f) an isolated polynucleotide that has at least 95% sequence identity to SEQ ID 
NO: 2; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 2; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(<0,(e),(f)or(g);and 
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i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 2 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 


10. An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 5 or SEQ 
ID NO: 5 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 4; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 4; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 4; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 4; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 4; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 4; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d), (e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 4 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

11. An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5* X-^VCR^-^^-Y 3', where X is hydrogen, Y is hydrogen or a metal, 
R, and R 3 are any nucleic acid, n is an integer between 0-3000, and R 2 is selected from 
the group consisting of: 

a) ai isolated polynucleotide encoding a polypeptide of SEQ ID NO: 5 or SEQ 
ID NO: 5 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 4; 
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c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 4; 

10 d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 

NO: 4; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 4; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
15 NO: 4; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 4; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b)> (c), 
(d),(e),(f)or(g);and 

20 i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 

ID NO: 4 and encodes a plant lecithinxholesterol acyltransferase-Iike 
polypeptide. 

12. An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 7 or SEQ 
ID NO: 7 with at least one conservative amino acid substitution; 
5 b) SEQ ID NO: 6; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 6; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 6; 

10 e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 

NO: 6; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 6; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
1 5 stringent conditions to SEQ ID NO: 6; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 
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i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 6 and encodes a plant lecithinrcholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5* X-(R l ) a -(R 2 ) 0 -(R 3 ) n -Y 3', where X is hydrogen, Y is hydrogen or a metal, 
R, and R 3 are any nucleic acid, n is an integer between 0-3000, and R 2 is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 7 or SEQ 
ID NO: 7 with at least one conservative amino acid substitution; 

b) SEQ ED NO: 6; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 6; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO:6; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 6; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 6; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 6; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 6 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 9 or SEQ ID 
NO: 9 with at least one conservative amino acid substitution; 

b) SEQ ID NO 8; 
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c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 8; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 8; 

10 e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 

NO: 8; 

0 an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 8; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
1 5 stringent conditions to SEQ ID NO: 8; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), (d), 
, (e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ ID 
NO: 8 and encodes a plant lecithimcholesterol acyltransferase-like polypeptide. 

15. An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X^X^^R^Y 3\ where X is hydrogen, Y is hydrogen or a metal, 
R, and R 3 are any nucleic acid, n is an integer between 0-3000, and It^ is selected from 
the group consisting of: 

5 a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 9 or SEQ 

ID NO: 9 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 8; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 8; 

10 d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 

NO: 8; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 8; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
15 NO: 8; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 8; 
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h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 8 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

k An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 74 or SEQ 
ID NO: 74 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 73; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 73; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 73; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 73; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 73; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ED NO: 73; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 73 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5* X-CRJ^HR^y 3', where X is hydrogen, Y is hydrogen or a metal, 
R, and R 3 are any nucleic acid, n is an integer between 0-3000, and is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 74 or SEQ 
ID NO: 74 with at least one conservative amino acid substitution; 
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b) SEQIDNO:73; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 73; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 73; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 73; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 73; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 73; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f) or(g); and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 73 and encodes a plant lecithinxholesterol acyltnmsferase-like 
polypeptide. 

1 8. A isolated nucleic acid sequence comprising a polynucleotide selected from the group 
consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 76 or SEQ 
ID NO: 76 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 75; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 75; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 75; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 75; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 75; - 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 75; 
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h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e), (f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO; 75 and encodes a plant lecithinrcholesterol acyltransferase-like 

20 polypeptide. 

i 9. An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-fRJ^R^-^X-Y 3\ where X is hydrogen, Y is hydrogen or a metal, 
R, and R 3 are any nucleic acid, n is an integer between 0-3000, and R 2 is selected from 
the group consisting of: 

5 a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 76 or SEQ 

ID NO: 76 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 75; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 75; 

10 d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 

NO: 75; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 75; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
15 NO: 75; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 75; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e), (f)or(g); and 

20 0 an isolated polynucleotide that hybridizes under stringent conditions to SEQ 

ED NO: 75 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

20. An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) SEQ ID NO: 42 or a degenerate variant thereof; 
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b) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 42; 

c) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 42; 

d) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 42; 

e) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 42; 

f) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 42; 

g) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),or(f);and 

h) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 42 and encodes an acyl CoAxholesterol acyltransferase-like 
polypeptide. 


An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-(R,)„-(R 2 ) n -(R 3 ) II -Y 3\ where X is hydrogen, Y is hydrogen or a metal R, 
and Rj are any nucleic acid, n is an integer between 0 and 3000, and R 2 is selected 
from the group consisting of: 

a) SEQ ID NO: 42 or degenerate variants thereof; 

b) an isolated polynucleotide having at least 70% sequence identity to SEQ ID 
NO: 42; 

c) an isolated polynucleotide having at least 80% sequence identity to SEQ ID 
NO: 42; 

d) an isolated polynucleotide having at least 90% sequence identity to SEQ ID 
NO: 42; 

e) an isolated polynucleotide having at least 95% sequence identity to SEQ ID 
NO: 42; 

f) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 42; 

g) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),or(f);and 


_O11630SA2J_> 


WO 01/16308 PCT/OS00/23863 

61 

h) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 42 and encodes an acyl CoAxholesterol acyltransferase-like 
20 polypeptide. 

22. A recombinant nucleic acid construct comprising a regulatory sequence operably 
linked to polynucleotide encoding a Iecithinxholesterol acyltransferase-like 
polypeptide or a fragment thereof. 

23. The recombinant nucleic acid construct of claim 22, wherein said lecithinxholesterol 
acyltransferase-like polypeptide is a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

24. A recombinant nucleic acid construct comprising a regulatory sequence operably 
linked to a polynucleotide encoding an acyl CoAxholesterol acyltransferase-like 
polypeptide. 


25. The recombinant nucleic acid construct of claim 24, wherein said acyl 
CoAxholesterol acyltransferase-like polypeptide is a plant acyl CoAxholesterol 
acyltransferase-like polypeptide. 

26. The recombinant construct of claim 22, wherein said regulatory sequence comprises a 
heterologous regulatory sequence. 

27. The recombinant construct of claim 24, wherein said regulatory sequence comprises a 
heterologous regulatory sequence. 

28. The recombinant construct of claim 22, wherein said regulatory sequence is functional 
in a plant cell. 

29. The recombinant construct of claim 24, wherein said regulatory sequence is functional 
in a plant cell. 

30. Hie recombinant construct of claim 22, further comprising a termination sequence. 
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31. The recombinant construct of claim 24 further comprising a termination sequence;. 

32. The recombinant construct of claim 22 wherein said polynucleotide is selected from 
the group consisting of SEQED NO: 2,4, 6, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 43, 44, 45, 46, 47, 48, 49, 50, 51, 73 and 75. 

33. The recombinant construct of claim 24, wherein said polynucleotide is selected from 
the group consisting of SEQ ED NO: 33 and 42. 

34. The recombinant construct of claim 22, wherein said regulatory sequence comprises a 
constitutive promoter. 

35. The recombinant construct of claim 24, wherein said regulatory sequence comprises a 
constitutive promoter. 

36. The recombinant construct of claim 22, wherein said regulatory sequence comprises 
an inducible promoter. 

37. The recombinant construct of claim 24, wherein said regulatory sequence comprises 
an inducible promoter. 

38. The recombinant construct of claim 22, wherein said regulatory sequence is selected 
from the group consisting of a tissue specific promoter, a developmentally regulated 
promoter, an organelle specific promoter, and a seed specific promoter. 

39. The recombinant construct of claim 24, wherein said regulatory sequence is selected 
from the group consisting of a tissue specific promoter, a developmentally regulated 
promoter, an organelle specific promoter, and a seed specific promoter. 

40. - A host cell containing the recombinant construct of claim 22 or 24. 
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41. The host cell of claim 40, wherein said host cell is selected from the group consisting 
of plant cells, animal cells, insect cells, yeast, bacteria, bacteriophage and viruses. 

42. The host cell of claim 40, wherein said host cell is a plant cell. 

43. The host cell of claim 40, wherein said host cell expresses a lecithinxholesterol 
acyltransferase-like polypeptide or an acyl CoAxholesterol acyltransferase-like 
polypeptide. 

44. The host cell of claim 43, wherein said cholesterol acyltransferase-like polypeptide is 
a plant acyltransferase-like polypeptide. 

45. A plant comprising at least one host cell of claim 40. 

46. The progeny of a plant of claim 45. 

47. A seed from the plant of claim 45. 

48. A plant comprising the recombinant construct of claim 22 or 24. 

49. The progeny of a plant of claim 48. 

50. A seed from the plant of claim 48. 

51. A purified polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 3, SEQ ID NO: 3 with at least one conservative amino acid 
substitution, SEQ ID NO: 5, SEQ ID NO: 5 with at least one conservative amino acid 
substitution, SEQ ID NO: 7, SEQ ID NO: 7 with at least one conservative amino acid 
substitution, SEQ ID NO: 9, SEQ ID NO: 9 with at least one conservative amino acid 
substitution, SEQ ED NO: 74, SEQ ID NO: 74 with at least one conservative amino 
acid substitution, SEQ ID NO: 76 and SEQ ID NO: 76 with at least one conservative 
amino acid substitution. 
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52. A purified immunogenic polypeptide comprising at least 1 0 consecutive amino acids 
from an amino acid sequence selected from the group consisting of SEQ ID NO: 3, 
SEQ ID NO: 3 with at least one conservative amino acid substitution, SEQ ID NO: 5, 
SEQ ID NO: 5 with at least one conservative amino acid substitution, SEQ ID NO: 7, 
SEQ ID NO: 7 with at least one conservative amino acid substitution, SEQ ID NO: 9, 
SEQ ID NO: 9 with at least one conservative amino acid substitution, SEQ ID NO: 
74, SEQ ID NO: 74 with at least one conservative amino acid substitution, SEQ ID 
NO: 76 and SEQ ID NO: 76 with at least one conservative amino acid substitution. 


53. An antibody which specifically binds to an immunogenic polypeptide of claim 52. 

54. A method for producing a lecithinxholesterol acyltransferase-like polypeptide or an 
acyl CoAxholesteroI acyltransferase-like polypeptide comprising culturing a host cell 
of claim 40 under conditions permitting expression of said lecithinxholesterol 
acyltransferase-like polypeptide or acyl CoAxholesteroI acyltransferase-like 
polypeptide. 

55. The method of claim 54, further comprising isolating the cholesterol acyltransferase- 
like polypeptide from the host cell or from the medium in which the host cell is 
cultured. 

56. A method for modifying the sterol content of a host cell, comprising transforming a 
host cell with a recombinant construct containing a regulatory sequence operably 
linked to a polynucleotide encoding a lecithinrcholesterol acyltransferase-like 
polypeptide and culturing said host cell under conditions wherein said host cell 
expresses a lecithinxholesterol acyltransferase-like polypeptide such that said host 
cell has a modified stool composition as compared to host cells without the 
recombinant construct. 

57. The method of claim 56, wherein said lecithinxholesterol acyltransferase-like 
polypeptide is a plant lecithinxholesterol acyltransferase-like polypeptide. 
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58. A method for modifying the sterol content of a host cell, comprising transforming a 
host cell with a recombinant construct containing a regulatory sequence operably 
linked to a polynucleotide encoding an acyl CoAxholesterol acyltransferase-like 
polypeptide and culturing said host cell under conditions wherein said host cell 

5 expresses an acyl CoAxholesterol acyltransferase-like polypeptide such that said host 

cell has a modified sterol composition as compared to host cells without the 
recombinant construct. 

59. The method of claim 58, wherein said acyl CoAxholesterol acyltransferase-like 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 

60. The method of claim 56, wherein said modified sterol composition is an increase in 
sterol esters. 

61. The method of claim 58, wherein said modified sterol composition is an increase in 
sterol esters. 

62. The method of claim 56, wherein said polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is selected from the group consisting of SEQ ID NO: 
2,4,6,8, 10, 11, 73 and 75. 

63.. The method of claim 58 wherein said polynucleotide encoding a acyl CoAxholesterol 
acyltransferase-like polypeptide is SEQ ID NO 33 OR 42. 

64. The method of claim 56, wherein said regulatory sequence comprises a constitutive 
promoter. 

65. The method of claim 58, wherein said regulatory sequence comprises a constitutive 
promoter. 

66. The method of claim 56, wherein said regulatory sequence is an inducible promoter. 

67. The method of claim 58, wherein said regulatory sequence is an inducible promoter. 
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68. The method of claim 56, wherein said regulatory sequence is a tissue specific 
promoter. 

69. The method of claim 58, wherein said regulatory sequence is a tissue specific 
promoter. 

70. The method of claim 56, wherein said regulatory sequence is a seed specific promoter. 

71 . The method of claim 58, wherein said regulatory sequence is a seed specific promoter. 

72. The method of claim 56, wherein said polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is in the antisense orientation. 

73. The method of claim 58, wherein said polynucleotide encoding an acyl 
CoAxholestero] acyltransferase-like polypeptide is in the antisense orientation. 

74. The method of claim 72, wherein said modified sterol composition is a decrease in 
sterol esters. 

75. The method of claim 73, wherein said modified sterol composition is a decrease in 
sterol esters. 

76. A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding a lecithinxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in modified 
sterol composition of said plant as compared to the same plant without said 

5 recombinant construct. 

77. The plant of claim 76, wherein said lecithinxholesterol acyltransferase-like 
polypeptide is a plant lecithinxholesterol acyltransferase-like polypeptide. 
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78. The plant of claim 76, wherein said polynucleotide encoding a lecithin:cholesterol 
acyltransferase-like polypeptide is selected from the group consisting of SEQ ID NO: 
2,4, 6, 8, 10, 11,73 and 75. 

79. A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding an acyl CoAxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in modified 
sterol composition of said plant as compared to the same plant without said 
recombinant construct. 

80. The plant of claim 79, wherein said acyl CoAxholesterol acyltransferase-like 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 

81 . The plant of claim 79, wherein said polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide is SEQ ID NO: 33 or 42. 

82* The plant of claim 76, wherein said regulatory sequence comprises a tissue specific 
promoter. 

83. The plant of claim 79, wherein said regulatory sequence comprises a tissue specific 
promoter 

84. The plant of claim 76, wherein said regulatory sequence comprises a seed specific 
promoter. 

85. The plant of claim 79, wherein said regulatory sequence comprises a seed specific 

•> 

promoter. 

86. The plant of claim 76, wherein said modified sterol composition is an increase in 
sterol esters. 

87. The plant of claim 79, wherein said modified sterol composition is an increase in 
sterol esters. 
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88* The plant of claim 76, wherein the polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is in the antisense orientation. 

89. The plant of claim 79, wherein the polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide is in the antisense orientation. 

90. An oil obtained from the plant of claim 76 or 79. 

91. A method for producing an oil with a modified sterol composition comprising, 
providing a plant of claim 76 or 79 and extracting the oil from said plant. 

92. An oil produced by the method of claim 91. 

93. A method for altering oil production by a host cell comprising, transforming a host 
cell with a recombinant construct containing a regulatory sequence operably linked to 
a polynucleotide encoding a lecithinxholesterol acyltransferase-like polypeptide and 
culturing said host cell under conditions wherein said host cell expresses a 

5 lecithinrcholesterol acyltransferase-like polypeptide such that said host cell has an 

altered oil production as compared to host cells without the recombinant construct. 

94. The method of claim 93, wherein said lecithinrcholesterol acyltransferase-like 
polypeptide is a plant lecithinxholesterol acyltransferase-like polypeptide. 

95. A method for altering oil production by a host cell comprising, transforming a host 
cell with a recombinant construct containing a regulatory sequence operably linked to 
a polynucleotide encoding an acyl CoAxholesterol acyltransferase-like polypeptide 
and culturing said host cell under conditions wherein said host cell expresses an acyl 

5 CoAxholesterol acyltransferase-like polypeptide such that said host cell has an altered 

oil production as compared to host cells without the recombinant construct. 

96. The method of claim 95, wherein said acyl CoAxholesterol acyltransferase-like 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 
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97. The method of claim 93, wherein said oil production is increased. 

98. The method of claim 95, wherein said oil production is increased. 

99. The method of claim 93, wherein said host cell is a plant cell. 

1 00. The method of claim 95, wherein said host cell is a plant cell. 

101. The method of claim 93, wherein said polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is selected from the group consisting of SEQ ED NO: 
2,4,6,8, 10, 11, 73 and 75. 

1 02. The method of claim 95, wherein said polynucleotide encoding an acyl 
CoArcholesterol acyltransferase-like polypeptide is SEQ ID NO: 33 or 42. 

1 03. The method of claim 93, wherein said regulatory sequence is a tissue specific 
promoter. 

1 04. The method of claim 95, wherein said regulatory sequence is a tissue specific 
promoter. 

1 05. The method of claim 93, wherein sfcid regulatory sequence is a seed specific promoter. 

106. The method of claim 95, wherein said regulatory sequence is a seed specific promoter. 

1 07. A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding a lecithinxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in an altered 
production of oil by said plant as compared to the same plant without said "~ 

5 recombinant construct 
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108. The plant of claim 107, wherein said Iecithin:cholesterol acyltransferase-like 
polypeptide is a plant lecithinxholesterol acyltransferase-like polypeptide. 

1 09. A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding an acyl CoAxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in an altered 
production of oil by said plant as compared to the same plant without said 
recombinant construct. 


1 1 0. The plant of claim 1 09, wherein said acyl CoAxhoiesterol acyltransferase-like 
polypeptide is a plant acyl CoArcholesterol acyltransferase-like polypeptide. 

111. The plant of claim 1 07, wherein said oil production is increased. 

112. The plant of claim 109, wherein said oil production is increased. 

113. The plant of claim 107, wherein said polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is selected from the group consisting of SEQ ID NO: 
2, 4, 6, 8,10,11,73 and 75. 

114. The plant of claim 109, wherein said polynucleotide encoding an acyl 
CoArcholesterol acyltransferase-like polypeptide is SEQ ID NO: 33 or 42. 

1 i 5. The plant of claim 107, wherein said regulatory sequence is a tissue specific promoter. 

116. The plant of claim 109, wherein said regulatory sequence is a tissue specific promoter. 

1 1 7. The plant of claim 107, wherein said regulatory sequence is a seed specific promoter. 

118. The plant of claim 109, wherein said regulatory sequence is a seed specific promoter. 

119. A food product comprising the oil of claim 90 or 92. 
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120. A food product comprising the plant of claim 107 or 109. 
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SEQUENCE LISTING 

<110> Monsanto Company 

<120> PLANT STEROL AC YLTRANS FERAS ES 

<130> MTC6718 

5 <140> 
<141> 

<150> 60/152,493 
<151> 1999-08-30 

<160> 80 

10 <170> Patentln Ver. 2.1 

<210> 1 
<211> 440 
<212> PRT 

<213> Homo sapiens 
15 <400> 1 

Met Gly Pro Pro Gly Ser Pro Trp Gin Trp Val Thr Leu Leu Leu Gly 
1 5 10 15 

Leu Leu Leu Pro Pro Ala Ala Pro Phe Trp Leu Leu Asn Val Leu Phe 
20 25 30 

20 Pro Pro His Thr Thr Pro Lys Ala Glu Leu Ser Asn His Thr Arg Pro 
35 40 45 

Val lie Leu Val Pro Gly Cys Leu Gly Asn Gin Leu Glu Ala Lys Leu 
50 55 60 

Asp Lys Pro Asp Val Val Asn Trp Met Cys Tyr Arg Lys Thr Glu Asp 
25 65 70 75 80 

Phe Phe Thr He Trp Leu Asp Leu Asn Met Phe Leu Pro Leu Gly Val 
85 90 95 

Asp Cys Trp He Asp Asn Thr Arg Val Val Tyr Asn Arg Ser Ser Gly 
100 105 110 

30 Leu Val Ser Asn Ala Pro Gly Val Gin He Arg Val Pro Gly Phe Gly 
115 120 125 

Lys Thr Tyr Ser Val Glu Tyr Leu Asp Ser Ser Lys Leu Ala Gly Tyr 
130 . 135 140 

Leu His Thr Leu Val Gin Asn Leu Val Asn Asn Gly Tyr Val Arg Asp 
35 145 ISO 155 160 

Glu Thr Val Arg Ala Ala Pro Tyr Asp Trp Arg Leu Glu Pro Gly Gin 
165 170 175 
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Gin Glu Glu Tyr Tyr Arg Lys Leu Ala Gly Leu Val Glu Glu Met His 
180 185 190 

Ala Ala Tyr Gly Lys Pro Val Phe Leu He Gly His Ser Leu Gly Cys 
195 200 205 

5 Leu His Leu Leu Tyr Phe Leu Leu Arg Gin Pro Gin Ala Trp Lys Asp 
210 215 220 

Arg Phe He Asp Gly Phe He Ser Leu Gly Ala Pro Trp Gly Gly Ser 
225 230 235 240 

He Lys Pro Met Leu Val Leu Ala Ser Gly Asp Asn Gin Gly He Pro 
10 245 250 255 

He Met Ser Ser He Lys Leu Lys Glu Glu Gin Arg lie Thr Thr Thr 
260 265 270 

Ser Pro Trp Met Phe Pro Ser Arg Met Ala Trp Pro Glu Asp His Val 
275 280 285 

15 Phe lie Ser Thr Pro Ser Phe Asn Tyr Thr Gly Arg Asp Phe Gin Arg 
290 295 300 

Phe Phe Ala Asp Leu His Phe Glu Glu Gly Trp Tyr Met Trp Leu Gin 
305 310 315 320 

Ser Arg Asp Leu Leu Ala Gly Leu Pro Ala Pro Gly Val Glu Val Tyr 
20 325 330 335 

Cys Leu Tyr Gly Val Gly Leu Pro Thr Pro Arg Thr Tyr He Tyr Asp 
340 345 350 

His Gly Phe Pro Tyr Thr Asp Pro Val Gly Val Leu Tyr Glu Asp Gly 
355 360 365 

25 Asp Asp Thr Val Ala Thr Arg Ser Thr Glu Leu Cys Gly Leu Trp Gin 
370 375 380 

Gly Arg Gin Pro Gin Pro Val His Leu Leu Pro Leu His Gly He Gin 
385 390 395 400 

His Leu Asn Met Val Phe Ser Asn Leu Thr Leu Glu His He Asn Ala 
30 405 410 415 

He Leu Leu Gly Ala Tyr Arg Gin Gly Pro Pro Ala Ser Pro Thr Ala 
420 425 430 

Ser Pro Glu Pro Pro Pro Pro Glu 
435 440 
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<210> 2 
<211> 1299 
<212> DNA 

<213> Arabidopsis thaliana 

5 <400> 2 

atgaaaaaaa tatcttcaca ttattcggta 
acctcgatgt gtcaagctgt gggtagcaac 
ggaggtaacc agctagaggt acggctggac 
agcagctggt tatatccgat tcataagaag 

10 gcagcagtgt tattgtctcc cttcaccagg 
gaccctgatt tggatgatta ccaaaatgct 
ggttcgacca aatcacttct atacctcgac 
gaacatttgg tgaaagctct agagaaaaaa 
ggagctccat atgatttcag gtacggcctg 

15 tcacagttcc tacaagacct caaacaattg 
aagccagtga tactcctctc ccatagccta 
cgtaccaccc cttcatggcg ccgcaagtac 
^9999^999* cgatctctca gatgaagaca 
ttagttaacc ctttgctggt cagacggcat 

20 cttccatcta ccaaagtgtt tcacgacaga 
aactacacag cttacgagat ggatcggttt 
gtgccttaca agacaagagt gttgccttta 
gtcacttgca tatatgggag aggagttgat 
ggattcgata agcaaccaga gattaagtat 

25 agcttagcag ctttgaaagt cgatagcttg 
acatctatac ttaaagacga gatcgcactt 
aattatgaat tagccaatgt taatgccgtc 


gtcatagcga tactcgttgt ggtgacgatg 60 
gtgtaccctt tgattctggt tccaggaaac 120 
agagaataca agccaagtag tgtctggtgt 180 
agtggtggat ggtttaggct atggttcgat 240 
tgcttcagcg atcgaatgat gttgtactat 300 
cctggtgtcc aaacccgggt tcctcatttc 360 
cctcgtctcc gagatgccac atcttacatg 420 
tgcgggtatg ttaacgacca aaccatccta 480 
gctgcttcgg gccacccgtc ccgtgtagcc 540 
gtggaaaaaa ctagcagcga gaacgaagga 600 
ggaggacttt tcgtcctcca tttcctcaac 660 
atcaaacact ttgttgcact cgctgcgcca 720 
tttgcttctg gcaacacact cggtgtccct 780 
cagaggacct ccgagagtaa ccaatggcta 840 
actaaaccgc ttgtcgtaac tccccaggtt 900 
tttgcagaca ttggattctc acaaggagtt 960 
acagaggagc tgatgactcc gggagtgcca 1020 
acaccggagg ttttgatgta tggaaaagga 1080 
ggagatggag atgggacggt taatttggcg 1140 
aacaccgtag agattgatgg agtttcgcat 1200 
aaagagatta tgaagcagat ttcaattatt 1260 
aatgaatga 1299 


<210> 3 
<211> 432 
30 <212> PRT 

<213> Arabidopsis thaliana 

<400> 3 

Met Lys Lys lie Ser Ser His Tyr Ser Val Val He Ala He Leu Val 
15 10 15 

35 Val Val Thr Met Thr Ser Met Cys Gin Ala Val Gly Ser Asn Val Tyr 
20 25 30 

Pro Leu He Leu Val Pro Gly Asn. Gly Gly Asn Gin Leu Glu Val Arg 
35 40 45 

Leu Asp Arg Glu Tyr Lys Pro Ser Ser Val Trp Cys Ser Ser Trp Leu 
40 50 55 60 

Tyr Pro lie His Lys Lys Ser Gly Gly Trp Phe Arg Leu Trp Phe Asp 
65 70 75 80 

Ala Ala Val Leu Leu Ser Pro Phe Thr Arg Cys Phe Ser Asp Arg Met 
85 90 95 

45 Met Leu Tyr Tyr Asp Pro Asp Leu Asp Asp Tyr Gin Asn Ala Pro Gly 
100 105 no 
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Val Gin Thr Arg Val Pro His Phe Gly Ser Thr Lys Ser Leu Leu Tyr 
115 120 125 

Leu Asp Pro Arg Leu Arg Asp Ala Thr Ser Tyr Met Glu His Leu Val 
130 135 140 

5 Lys Ala Leu Glu Lys Lys Cys Gly Tyr Val Asn Asp Gin Thr He Leu 
145 150 155 160 

Gly Ala Pro Tyr Asp Phe Arg Tyr Gly Leu Ala Ala Ser Gly His Pro 
165 170 175 

Ser Arg Val Ala Ser Gin Phe Leu Gin Asp Leu Lys Gin Leu Val Glu 
10 180 185 190 

Lys Thr Ser Ser Glu Asn Glu Gly Lys Pro Val He Leu Leu Ser His 
"5 200 205 

Ser Leu Gly Gly Leu Phe Val Leu His Phe Leu Asn Arg Thr Thr Pro 
210 215 220 

15 Ser Trp Arg Arg Lys Tyr He Lys His Phe Val Ala Leu Ala Ala Pro 
225 230 235 240 

Trp Gly Gly Thr He Ser Gin Met Lys Thr Phe Ala Ser Gly Asn Thr 
245 250 255 

Leu Gly Val Pro Leu Val Asn Pro Leu Leu Val Arg Arg His Gin Arg 
20 260 265 270 

Thr Ser Glu Ser Asn Gin Trp Leu Leu Pro Ser Thr Lys Val Phe His 
2? 5 280 285 

Asp Arg Thr Lys Pro Leu Val Val Thr Pro Gin Val Asn Tyr Thr Ala 
290 295 300 

25 Tyr Glu Met Asp Arg Phe Phe Ala Asp He Gly Phe Ser Gin Gly Val 
305 310 315 320 

Val Pro Tyr Lys Thr Arg Val Leu Pro Leu Thr Glu Glu Leu Met Thr 
325 330 335 

Pro Gly Val Pro Val Thr Cys He Tyr Gly Arg Gly Val Asp Thr Pro 
30 340 345 350 

Glu Val Leu Met Tyr Gly Lys Gly Gly Phe Asp Lys Gin Pro Glu He 
355 360 365 

Lys Tyr Gly Asp Gly Asp Gly Thr Val Asn Leu Ala Ser Leu Ala Ala 
370 375 380 

35 Leu Lys Val Asp Ser Leu Asn Thr Val Glu He Asp Gly Val Ser His 
385 390 395 400 

Thr Ser lie Leu Lys Asp Glu He Ala Leu Lys Glu He Met Lys Gin 
405 410 415 
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He Ser He He Asn Tyr Glu Leu Ala Asn Val Asn Ala Val Asn Glu 
420 425 430 


<210> 4 
5 <211> 1641 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 4 

atgggagcga attcgaaatc agtaacggct tccttcaccg tcatcgccgt ttttttcttg 60 

10 atttgcggtg gccgaactgc ggtggaggat gagaccgagt ttcacggcga ctactcgaag 120 
ctatcgggta taatcattcc gggatttgcg tcgacgcagc tacgagcgtg gtcgatcctt 180 
gactgtccat acactccgtt ggacttcaat ccgctcgacc tcgtatggct agacaccact 240 
aagcttcttt ctgctgtcaa ctgctggttt aagtgtatgg tgctagatcc ttataatcaa 300 
acagaccatc ccgagtgtaa gtcacggcct gacagtggtc tttcagccat cacagaattg 360 

15 gatccaggtt acataacagg tcctctttct actgtctgga aagagtggct taagtggtgt 420 
gttgagtttg gtatagaagc aaatgcaatt gtcgctgttc catacgattg gagattgtca 4 80 
ccaaccaaat tggaagagcg tgacctttac tttcacaagc tcaagttgac ctttgaaact 540 
gctttaaaac tccgtggcgg cccttctata gtatttgccc attcaatggg taataatgtc 600 
ttcagatact -ttctggaatg gctgaggcta gaaattgcac caaaacatta tttgaagtgg 660 

20 cttgatcagc atatccatgc ttatttcgct gttggagctc ctcttcttgg ttctgttgag 720 
gcaatcaaat ctactctctc tggtgtaacg tttggccttc ctgtttctga gggaactgct 780 
cggttgttgt ccaattcttt tgcgtcgtca ttgtggctta tgccattttc aaagaattgc 840 
aagggtgata acacatcctg gacgcatttt tctgggggtg ctgcaaagaa agataagcgc 900 
gtataccact gtgatgaaga ggaatatcaa tcaaaatatt ctggctggcc gacaaatatt 960 

25 attaacattg aaattccttc cactagcgtt acagaaacag ctctagtcaa catgaccagc 1020 
atggaatgtg gccttcccac ccttttgtct ttcacagccc gtgaactagc agatgggact 1080 
cttttcaaag caatagaaga ctatgaccca gatagcaaga ggatgttaca ccagttaaag 1140 
aagttgtatc atgatgaccc tgtttttaat cctctgactc cttgggagag accacctata 1200 
aaaaatgtat tttgcatata tggtgctcat ctaaagacag aggttggtta ttactttgcc 1260 

30 ccaagtggca aaccttatcc tgataattgg atcatcacgg atatcattta cgaaactgaa 1320 
ggttccctcg tgtcaaggtc tggaactgtg gttgatggga acgctggacc tataactggg 1380 
gatgagacgg taccctatca ttcactctct tggtgcaaga attggctcgg acctaaagtt 1440 
aacataacaa tggctcccca gccagaacac gatggaagcg acgtacatgt ggaactaaat 1500 
gttgatcatg agcatgggtc agacatcata gctaacatga caaaagcacc aagggttaag 1560 

35 tacataacct tttatgaaga ctctgagagc attccgggga agagaaccgc agtctgggag 1620 
cttgataaaa gtgggtatta a 1641 


<210> 5 
<211> 546 
<212> PRT 
40 <213> Arabidopsis thaliana 

<400> 5 

Met Gly Ala Asn Ser Lys Ser Val Thr Ala Ser Phe Thr Val He Ala 
1 5 10 15. 

Val Phe Phe Leu He Cys Gly Gly, Arg Thr Ala Val Glu Asp Glu Thr 
45 20 25 30 

Glu Phe His Gly Asp Tyr Ser Lys Leu Ser Gly He He He Pro Gly 
3S 40 45 
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Phe Ala Ser Thr Gin Leu Arg Ala Trp Ser lie Leu Asp Cys Pro Tyr 
50 55 eo 

Thr Pro Leu Asp Phe Asn Pro Leu Asp Leu Val Trp Leu Asp Thr Thr 
65 7 ° 75 80 

5 Lys Leu Leu Ser Ala Val Asn Cys Trp Phe Lys Cys Met Val Leu Asp 

85 go 95 

Pro Tyr Asn Gin Thr Asp His Pro Glu Cys Lys Ser Ar 9 Pro Asp Ser 
1°° 105 no 

Gly Leu Ser Ala lie Thr Glu Leu Asp Pro Gly Tyr He Thr Gly Pro 
10 11S 120 125 

Leu Ser Thr Val Trp Lys Glu Trp Leu Lys Trp Cys Val Glu Phe Gly 
130 135 140 

He Glu Ala Asn Ala He Val Ala Val Pro Tyr Asp Trp Arg Leu Ser 
145 150 155 160 

15 Pro Thr Lys Leu Glu Glu Arg Asp Leu Tyr Phe His Lys Leu Lys Leu 

165 170 ' 175 

Thr Phe Glu Thr Ala Leu Lys Leu Arg Gly Gly Pro Ser He Val Phe 
180 185 190 

Ala His Ser Met Gly Asn Asn Val Phe Arg Tyr Phe Leu Glu Trp Leu 
20 195 200 205 

Arg Leu Glu He Ala Pro Lys His Tyr Leu Lys Trp Leu Asp Gin His 
21° 215 220 

He His Ala Tyr Phe Ala Val Gly Ala Pro Leu Leu Gly ser Val Glu 
225 230. 235 240 

25 Ala He Lys Ser Thr Leu Ser Gly Val Thr Phe Gly Leu Pro Val Ser 

245 250 255 

Glu Gly Thr Ala Arg Leu Leu Ser Asn Ser Phe Ala Ser Ser Leu Trp 
26 <> 265 270 

Leu Met Pro Phe Ser Lys Asn Cys Lys Gly Asp Asn Thr Ser Trp Thr 
30 27 5 280 285 

His Phe Ser Gly Gly Ala Ala Lys Lys Asp Lys Arg Val Tyr His Cys 
290 295 300 

Asp Glu Glu Glu Tyr Gin Ser Lys Tyr Ser Gly Trp Pro Thr Asn He 
305 310 315 320 

35 He Asn lie Glu He Pro Ser Thr Ser Val Thr Glu Thr Ala. Leu Val 

325 330 335 

Asn Met Thr Ser Met Glu Cys Gly Leu Pro Thr Leu Leu Ser Phe Thr 
340 345 350 
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Ala Arg Glu Leu Ala Asp Gly Thr Leu Phe Lys Ala He Glu Asp Tyr 
355 360 365 

Asp Pro Asp Ser Lys Arg Met Leu His Gin Leu Lys Lys Leu Tyr His 
370 375 380 

5 Asp Asp Pro Val Phe Asn Pro Leu Thr Pro Trp Glu Arg Pro Pro He 
385 390 395 400 

Lys Asn Val Phe Cys He Tyr Gly Ala His Leu Lys Thr Glu Val Gly 
405 410 4is 

Tyr Tyr Phe Ala Pro Ser Gly Lys Pro Tyr Pro Asp Asn Trp He lie 
10 420 425 430 

Thr Asp He He Tyr Glu Thr Glu Gly Ser Leu Val Ser Arg Ser Gly 
435 440 445 

Thr Val Val Asp Gly Asn Ala Gly Pro He Thr Gly Asp Glu Thr Val 
450 455 460 

15 Pro Tyr His Ser Leu Ser Trp Cys Lys Asn Trp Leu Gly Pro Lys Val 
465 470 475 480 

Asn He Thr Met Ala Pro Gin Pro Glu His Asp Gly Ser Asp Val His 
485 490 495 

Val Glu Leu Asn Val Asp His Glu His Gly Ser Asp He He Ala Asn 
20 500 505 510 

Met Thr Lys Ala Pro Arg Val Lys Tyr He Thr Phe Tyr Glu Asp Ser 
515 520 525 

Glu Ser He Pro Gly Lys Arg Thr Ala Val Trp Glu Leu Asp Lys Ser 
530 535 540 

25 Gly Tyr 
545 


<210> 6 
<211> 1608 
<212> DNA 
30 <213> Arabidopsis thaliana 

<400> 6 

atgtctctat tactggaaga gatcattaga tcagtagagg ctttgctgaa gctcagaaat 60 
cggaatcaag aaccctatgt tgacccgaat ctaaacccgg ttcttctagt tccaggaatc 120 
gctggatcaa ttctaaacgc cgttgatcat gagaacggga acgaagaacg tgtttgggtt 180 

35 aggatctttg gtgctgatca tgagtttcga acaaagatgt ggtctcgatt tgatccttca 240 
actggtaaaa cgatatctct tgatccaaaa acgagtattg ttgttcctca agacagagct 300 
gggctacatg caattgatgt cttagaccct gatatgattg ttggccgtga gtctgtgtac 360 
tatttccatg agatgattgt tgagatgatc ggatggggat ttgaagaagg gaaaaccctt 420 
tttggttttg gttatgattt ccgccaaagc aacagactgc aggaaacgtt ggaccagttt 480 

40 gctaaaaagt tggaaactgt ttataaagcc tcaggagaga agaagattaa tgttattagt 540 
cattctatgg gaggactatt ggtgaaatgt ttcatgggtc tgcatagtga tatattcgag 600 
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aagtatgtac agaattggat tgctattgct 
acatcgactt tattgaatgg aatgtcgttt 
tctaagtgga gcatgcatca gctgcttatt 
tgtccgtatt ttaaatggga gctccctccc 
5 aatgatggag ttggaacctc tgatgttgtt 
gaagttttta cgaaatctct ttcgaataat 
cttcctttta actggaagat catggagtgg 
gccaagctgc ctccgaaagt taaattctat 
catagtgttt gctatgggaa tgagaagatg 

10 ttccagccga catatatatg cgtggatggt 
gcggatgggc ttgaagcagt agcaagagtt 
aacgatcacc gtgtcttccg aatgctcaaa 
ttctacaacc cagtaaacga ttatgtcatc 
catgagaatg gactcgaggt tgcttccgtg 

15 aacaatatcg gcacaaccgg gtcaaccgtg 
gatcaaaacc ctcaagctga agctcgtgca 
agacaacatg tagagctcaa tgctgtaagt 


gctccatttc gaggtgctcc tggatatatc 660 
gtcaatggtt gggaacagaa ctttttcgtc 720 
gagtgtccat ccatatatga gctgatgtgt 780 
gtcttagagc tgtggagaga gaaagagagc 840 
cttgagtctt accgtagcct ggagagcctt 900 
acagctgatt attgtggaga gtcgatcgat 960 
gctcacaaaa ccaagcaagt attagcctct 1020 
aacatatatg ggaccaatct agaaacccct 1080 
cccgttaaag atctaacgaa tctaagatac 1140 
gatggcacag tcccgatgga atctgccatg 1200 
ggagtccctg gtgagcaccg aggaatcctc 1260 
aaatggctaa atgtaggcga accagacccg 1320 
cttcccacca catatgaatt tgagaaattc 1380 
aaagaatcgt gggacatcat atcagatgac 1440 
aactccatat cagtctctca acctggagat 1500 
accttaaccg tccaaccaca aagcgatggt 1560 
gtctctgttg atgcataa 1608 


<210> 7 
<211> 535 
20 <212> PRT 

<213> Arabidopsis thaliana 

<400> 7 

Met Ser Leu Leu Leu Glu Glu He He Arg Ser Val Glu Ala Leu Leu 
1 5 10 is 

25 Lys Leu Arg Asn Arg Asn Gin Glu Pro Tyr Val Asp Pro Asn Leu Asn 
20 25 30 

Pro Val Leu Leu Val Pro Gly He Ala Gly Ser He Leu Asn Ala Val 
35 40 45 

Asp His Glu Asn Gly Asn Glu Glu Arg Val Trp Val Arg He Phe Gly 
30 50 55 60 

Ala Asp His Glu Phe Arg Thr Lys Met Trp Ser Arg Phe Asp Pro Ser 
65 70 75 80 

Thr Gly Lys Thr He Ser Leu Asp Pro Lys Thr Ser He Val Val Pro 
85 90 95 

35 Gin Asp Arg Ala Gly Leu His Ala He Asp Val Leu Asp Pro Asp Met 
100 105 no 

He Val Gly Arg Glu Ser Val Tyr Tyr Phe His Glu Met He Val Glu 
115 120 125 

Met He Gly Trp Gly Phe Glu Glu Gly Lys Thr Leu Phe Gly Phe Gly 
40 130 135 140 

Tyr Asp Phe Arg Gin Ser Asn Arg Leu Gin Glu Thr Leu Asp Gin Phe 
145 150 155 160 
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Ala Lys Lys Leu Glu Thr Val Tyr Lys Ala Ser Gly Glu Lys Lys He 
165 170 175 

Asn Val He Ser His Ser Met Gly Gly Leu Leu Val Lys Cys Phe Met 
180 185 190 

5 Gly Leu His Ser Asp He Phe Glu Lys Tyr Val Gin Asn Trp He Ala 
195 200 205 

He Ala Ala Pro Phe Arg Gly Ala Pro Gly Tyr He Thr Ser Thr Leu 
210 215 220 

Leu Asn Gly Met Ser Phe Val Asn Gly Trp Glu Gin Asn Phe Phe Val 
10 225 230 235 240 

Ser Lys Trp Ser Met His Gin Leu Leu He Glu Cys Pro Ser He Tyr 
245 250 255 

Glu Leu Met Cys Cys Pro Tyr Phe Lys Trp Glu Leu Pro Pro Val Leu 
260 265 270 

15 Glu Leu Trp Arg Glu Lys Glu Ser Asn Asp Gly Val Gly Thr Ser Asp 
275 280 285 

Val Val Leu Glu Ser Tyr Arg Ser Leu Glu Ser Leu Glu Val Phe Thr 
290 295 300 

Lys Ser Leu Ser Asn Asn Thr Ala Asp Tyr Cys Gly Glu Ser He Asp 
20 305 310 315 320 

Leu Pro Phe Asn Trp Lys He Met Glu Trp Ala His Lys Thr Lys Gin 
325 330 335 

Val Leu Ala Ser. Ala Lys Leu Pro Pro Lys Val Lys Phe Tyr Asn He 
340 345 350 

25 Tyr Gly Thr Asn Leu Glu Thr Pro His Ser Val Cys Tyr Gly Asn Glu 
355 360 365 

Lys Met Pro Val Lys Asp Leu Thr Asn Leu Arg Tyr Phe Gin Pro Thr 
370 375 380 

Tyr He Cys Val Asp Gly Asp Gly Thr Val Pro Met Glu Ser Ala Met 
30 385 390 395 400 

Ala Asp Gly Leu Glu Ala Val Ala Arg Val Gly Val Pro Gly Glu His 
405 410 415 

Arg Gly He Leu Asn Asp His Arg Val Phe Arg Met Leu Lys Lys Trp 
420 425 430 

35 Leu Asn Val Gly Glu Pro Asp Pro Phe Tyr Asn Pro Val Asn Asp Tyr 
435 140 445 

Val He Leu Pro Thr Thr Tyr Glu Phe Glu Lys Phe His Glu Asn Gly 
450 455 460 
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Leu Glu Val Ala Ser Val Lye Glu Ser Trp Asp lie He Ser Asp Asp 
465 470 475 480 

Asn Asn He Gly Thr Thr Gly Ser Thr Val Asn Ser He Ser Val Ser 
485 490 495 

5 Gin Pro Gly Asp Asp Gin Asn Pro Gin Ala Glu Ala Arg Ala Thr Leu 
500 505 510 

Thr Val Gin Pro Gin Ser Asp Gly Arg Gin His Val Glu Leu Asn Ala 
51S 520 525 

Val Ser Val Ser Val Asp Ala 
10 530 535 


<210> 8 
<211> 1344 
<212> DNA 

<213> Arabidopsis thaliana 
15 <400> 8 

atgggctgga ttccgtgtcc gtgctgggga accaacgacg atgaaaacgc cggcgaggtg 60 
gcggatcgtg atccggtgct tctagtatct ggaattggag gctctattct gcattctaag 120 
aagaagaatt caaagtctga aattcgggtt tgggtccgaa tatttctagc taaccttgcc 180 
tttaagcaga gcctctggtc tctctataat cccaaaactg gttatacaga gccgttggat 240 

20 gataatattg aagtattggt ccctgatgat gaccatggac tctatgcaat tgacattcta 300 
gatccctctt ggtttgtaaa gctttgtcac ttgacggagg tttatcactt tcacgatatg 360 
atagaaatgc tggttggatg cggttataag aaggggacta cattattcgg ttatggttac 420 
gatttccgtc aaagcaatag gatcgatcta cttatactag gtctgaagaa gaagctggaa 480 
actgcatata aacgttcagg ggggagaaaa gtcactatca tctcccattc aatgggagga 540 

25 cttatggttt catgtttcat gtatctccat ccggaggcat tttccaagta tgtaaataaa 600 
tggattacaa ttgcaacacc tttccaagga gcaccagggt gcatcaatga ttcaatcttg 660 
actggagtgc aatttgtgga agggttagaa^ agtttctttt ttgtgtcacg ttggacgatg 720 
caccaactgt tggtcgaatg cccatctata tatgagatga tggcaaatcc agactttaag 780 
tggaaaaagc aaccagagat tcgagtttgg cgtaagaaat ctgaaaacga cgttgatact 840 

30 tctgtagaac tggaatcatt tggcttaatc gagagtattg atctattcaa cgatgcatta 900 
aaaaataacg agctaagcta tggtgggaat aaaatagctt tgccctttaa ctttgctatc 960 
ctcgactggg ctgctaagac aagagaaatt ctcaacaaag cgcaacttcc tgatggagtg 1020 
tccttctata acatatatgg agtgtcactt aatacaccct ttgatgtttg ttatggcaca 1080 
gagacttctc cgatagacga tttgtctgaa atatgtcaaa ctatgcctga gtatacatat 1140 

35 gtagatggag atggaactgt ccctgctgaa tcagctgcag ctgctcagtt taaagcagtt 1200 
gctagcgtag gagtttcggg tagccaccgc gggcttctcc gtgatgaaag agtgtttgag 1260 
ctcattcaac aatggttagg agttgagccc aagaaggcta aacggaagca tttaaggact 1320 
cacaaagtag ttgattctgg ttaa 1344 

<210> 9 
40 <211> 447 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 9 

Met Gly Trp He Pro Cys Pro Cys Trp Gly Thr Asn Asp Asp Glu Asn 
45 1 5 10 15 
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Ala Gly Glu Val Ala Asp Arg Asp Pro Val Leu Leu Val Ser Gly lie 
20 25 30 

Gly Gly Ser lie Leu His Ser Lys Lys Lys Asn Ser Lys Ser Glu lie 
35 40 45 

5 Arg Val Trp Val Arg lie Phe Leu Ma Asn Leu Ala Phe Lys Gin Ser 
50 55 60 

Leu Trp Ser Leu Tyr Asn Pro Lys Thr Gly Tyr Thr Glu Pro Leu Asp 
65 70 75 80 

Asp Asn He Glu Val Leu Val Pro Asp Asp Asp His Gly Leu Tyr Ala 
10 85 90 95 

He Asp He Leu Asp Pro Ser Trp Phe Val Lys Leu Cys His Leu Thr 
100 105 no 

Glu Val Tyr His Phe His Asp Met He Glu Met Leu Val Gly Cys Gly 
115 120 125 

15 Tyr Lys Lys Gly Thr Thr Leu Phe Gly Tyr Gly Tyr Asp Phe Arg Gin 
130 135 140 

Ser Asn Arg He Asp Leu Leu lie Leu Gly Leu Lys Lys Lys Leu Glu 
"5 150 155 160 

Thr Ala Tyr Lys Arg Ser Gly Gly Arg Lys Val Thr He He Ser His 
20 165 170 175 

Ser Met Gly Gly Leu Met Val Ser Cys Phe Met Tyr Leu His Pro Glu 
180 185 190 

Ala Phe Ser Lys Tyr Val Asn Lys Trp He Thr He Ala Thr Pro Phe 
195 200 205 

25 Gin Gly Ala Pro Gly Cys He Asn Asp Ser He Leu Thr Gly Val Gin 
210 215 220 

Phe Val Glu Gly Leu Glu Ser Phe Phe Phe Val Ser Arg Trp Thr Met 
225 230 235 240 

His Gin Leu Leu Val Glu Cys Pro Ser He Tyr Glu Met Met Ala Asn 
30 245 250 255 

Pro Asp Phe Lys Trp Lys Lys Gin Pro Glu He Arg Val Trp Arg Lys 
260 265 270 

Lys Ser Glu Asn Asp Val Asp Thr Ser Val Glu Leu Glu Ser Phe Gly 
275 280 285 

35 Leu He Glu, Ser He Asp Leu Phe Asn Asp Ala Leu Lys Asn Asn Glu 
290 295 300 

Leu Ser Tyr Gly Gly Asn Lys He Ala Leu Pro Phe Asn Phe Ala lie 
305 310 315 320 
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Leu Asp Trp Ala Ala Lys Thr Arg Glu He Leu Asn Lys Ala Gin Leu 
325 330 335 

Pro Asp Gly Val Ser Phe Tyr Asn He Tyr Gly Val Ser Leu Asn Thr 
340 345 350 

5 Prq^Phe Asp Val Cys Tyr Gly Thr Glu Thr Ser Pro lie Asp Asp Leu 

" ViV? 355 360 365 

Ser Glu He Cys Gin Thr Met Pro Glu Tyr Thr Tyr Val Asp Gly Asp 
370 375 360 

Gly Thr Val Pro Ala Glu Ser Ala Ala Ala Ala Gin Phe Lys Ala Val 
10 385 390 395 400 

Ala Ser Val Gly Val Ser Gly Ser His Arg Gly Leu Leu Arg Asp Glu 
405 • 410 415 

Arg Val Phe Glu Leu He Gin Gin Trp Leu Gly Val Glu Pro Lys Lys 
420 425 430 

15 Ala Lys Arg Lys His Leu Arg Thr His Lys Val Val Asp Ser Gly 
435 440 445 


<210> 10 
<211> 3107 
<212> DNA 
20 <213> Arabidopsis t ha liana 

<400> 10 

cctttttgat ctttcagctc aatgagcttt 
aatttcaaag tttccacatc gagtttattc 
tgttatccag ctttgaactc ctcccgaccc 

25 tgtgggttgc atctttgtta cgatctgcat 
tgctatggta gagatgggca atgttattgt 
cttaactaat caattatctc tttgattcag 
tgacaatgaa actgggttgg atccagctgg 
ggctgctgac tactttgctc ctggctactt 

30 acatattgga tatgaagaga aaaatatgta 
tcagaacaca gaggttcttt tctcatcgtt 
ctttcttcat tacttaaggc ttaaatatgt 
actcttagcc gtatgaaaag taatatagag 
gcagttatag ttccgcattc catgggggtc 

35 gaggcaccag ctcctctggg tggcgggggt 
gcggtgatga acattggtgg accatttctt 
tctgctgaag caaaggatgt tgcagttgcc 
gatcagaacc ttggctctgg aactcaaagt 
tgctatatta tcgctgcaac tgacattggt 

40 tctcttgaga ttagacaaat gatgaattga 
gtttcttagc ttcgacgata acatttgtca 
tatggaaagc gctggagctt ccatgcttgc 
tttcaatttt cttgtatatg catcta.ggt 
ggagtagttg ctctattagt cgcttggttc 

45 ttgcagagcg attgccccag gattcttaga 
gcatgtaatg agaatgacac gcacatggga 


tctcaatttt ttgggggaac tgaatatgtg 60 
acacgtcttg aatttcgtcc atcctcgttc 120 
tgctatggat atattaaaaa aaaagtgttt 180 
cttcttcttt cggctcagtg ttcatgtttt 240 
tgatggtaac agtggtatag ttgatagtat 300 
gcctctatgt tgggtggaac acatgtcact 360 
tattagagtt cgagctgtat caggactcgt 420 
tgtctgggca gtgctgattg ctaaccttgc 480 
catggctgca tatgactggc ggctttcgtt 540 
ctttctatta ttctgttcca tgttacgttt 600 
ttcatgttga attaataggt acgtgatcag 660 
ttgatggttt ctaccaatgg tggaaaaaaa 720 
ttgtattttc tacattttat gaagtgggtt 780 
gggccagatt ggtgtgcaaa gtatattaag 840 
ggtgttccaa aagctgttgc agggcttttc 900 
aggtattgaa tatctgctta tacttttgat 960 
tattctacta aatatcaatt ctaataacat 1020 
tgattatttt gctgcttatg taactgaaac 1080 
taattcttac gcattgctct gtgatgacca 1140 
tactgtcttt tggagggcat tgaattttgc 1200 
attctttacc aattagcatt attctgcttc 1260 
cttttatttc ttcttaatta aagactcgtt 1320 
cttaatatag aactttactt tcttcgaaaa 1380 
caccgatata tttagacttc agaccttgca 1440 
ctcaacaatg tctatgttac cgaagggagg 1500 
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tgacacaata tggggcgggc ttgattggtc 
aaagcaaaag aacaacgaaa cttgtggtga 
tcctgttaac tatggaagga tgatatcttt 
tgagattaat aatattgatt ttcgagtaag 
5 tttgtgattg tatgatgaat atctgtacat 
agagtatccc aaatcacacc tgtcgtgacg 
ctgggatcaa agctatcgct gagtataagg 
tacattatgt tgctcctaag atgatggcgc 
ctgatgattt ggatgacacc aagtatcaag 

10 caaagtaagt gatttcttga ttccaactgt 
ttgttttcgg tcttgttgga tatggttttc 
ctttctcaaa aaggcttgct cagttatatt 
gcttataaat cctccgtttg gtttgttctg 
ggaaatctac tcattatacg gagtggggat 

15 taaccagtct cccgacagtt gcatcccctt 
cgaagatagc tgtctgaaag caggagttta 
cctaagtgcc gggtacatgt gtgcaaaagc 
cggaatcaag acttacataa gagaatacaa 
gcgcgggacg cagagtggtg cccatgttga 

20 tatcatgagg gttgccgccg gaggtaacgg 
tggcatattt gaatggtcgg agcgtattga 
agctgtcctg tcagcttatg tgaatccaat 
catcatcgtc atcatcatga tgctcaactc 
cgaaattctc aatacctctt taatattctt 

25 tgtttgaacg ataacgcaaa acttgctgcg 
atttgtgggt tatacgtagt gtagaggatg 
agttaattct gaaaatgcaa caccacatga 


accggagaaa ggccacacct gttgtgggaa 1560 
agcaggtgaa aacggagttt ccaagaaaag 1620 
tgggaaagaa gtagcagagg ctgcgccatc 1680 
gacatataaa tcataataaa ccttgtacat 1740 
tttatctggt gaagggtgct gtcaaaggtc 1800 
tgtggacaga gtaccatgac atgggaattg 1860 
tctacactgc tggtgaagct atagatctac 1920 
gtggtgccgc tcatttctct tatgggattg 1980 
atcccaaata ctggtcaaat ccgttagaga 2040 
atccttcgtc ctgatgcatt atcagtcttt 2100 
agctcaaagc ttacaaagct gtttctgagc 2160 
gaggtgctaa agttgataca tgtgactctt 2220 
ctttttcaga ttaccgaatg ctcctgagat 2280 
accaacggaa cgagcatacg tatacaagct 2340 
tcagatattc acttctgctc acgaggagga 2400 
caatgtggat ggggatgaaa cagtacctgt 2460 
gtggcgtggc aagacaagat tcaacccttc 2520 
tcactctccg ccggctaacc tgttggaagg 2580 
tatcatggga aactttgctt tgatcgaaga 2640 
gtctgatata ggacatgacc aggtccactc 2700 
cctgaagctg tgaatatcat gatctcttta 2760 
actttgaaag agagatcatc atcaattcat 2820 
acaaagaagc ctgagaatga tactttggtg 2880 
attgaatgta aattatacaa tcctatctaa 2940 
ccatgtttgt ttgtcttgtc aaaagcatca 3000 
attcaaattt gtgataaatt tggtaatcaa 3060 
actatgtcac taaggcc 3107 


<210> 11 
<211> 1680 
30 <212> DNA 

<213> Arabidopsis t ha liana 

<220> 

<221> unsure 
<222> {694} 
35 <223> n=unkown 

<400> 11 

cgcataaggt gttcgagtgt ttgcagcttg 
aaagatctga accataaaaa tgaccaatca 
aaaaatgtta tagttctcat gaatactaac 

40 catggaccta acctggcgta acggtatgtc 
ggctttaggg acacctaaaa aggcgggtcc 
acaccaacct ggaccacccc caccgcctcc 
gaagtgaagg aaatagatag cccccattga 
attggtggca tacattagct cgattttgct 

45 ctgctcggtt tcaatccaaa aactatagat 
acatacacta aagtagctaa tcatggaggt 
gaccaccagt gttgtcaatg tgatcatata 
tcggtgttat ggaaagagag tctccaatca 
atatccaatt tttgccaaat tctctatgag 

50 atagtctgca gccactagtc ctgggactgc 
ggtctcactg tctagagata agtgctccaa 
acaattaaac gaccaagtat acacttcaaa 


agaagttccg agtccaagag acctggagcc .60 
aaatccatta agccaattca aatattcact 120 
ataacaagtg aaagtaaatt taaaaatgtt 180 
tttgccttca gcagaaagta aattactgac 240 
aatgttgacg acggatttga tgtgtttggc 300 
atcaggaaga ggtgtttcta cccatttaag 360 
atgcggaacc accacaactt tcttaaaccc 420 
cttcagtcta cttaacgatt ggtcacgtac 480 
tagtccaaag ctctacaaca atatgtaatt 540 
cttatagtat atcattatca tcattctcta 600 
ggtattaata acgactaatc tgagcatacc 660 
taangaggcc atgtgaaggt tcttgccttc 720 
aactgcccaa gcaaagtagc atggtgcgaa 780 
tcggacacgg attcccggtg gatcgagacc 840 
ccagcacaat ggcctataaa tcaaattaca 900 
ctaattcaga attgagaaaa tcgaaatgct 960 
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aaccagaaaa tcatgtaaat caaaaaccgt 
gaatccatgt taaaaccata accaaaaata 
gttaaaactg atattctaaa tttagtaagt 
tttgaacata aatatatttt atagttttgt 
5 caaaacttac ggtttccatt tccaaattcg 
acgattaatt attaatttcc taaaaccatc 
aactagttgc aaaacaataa caaaacccaa 
aaatccaaag accatccata acaacaacag 
aaaccaagct taaccaatta tacatacccc 
10 gggtctataa aaatatctcg aaccgaacat 
tcggagaagc tagcacccca aagacgttta 
tcccaaagct cgagcccgcc ggttacaatc 


aacaatcaat atatatatat atattttcca 1020 
tatgaaaatt tagaaatact aaaataatat 1080 
tttaaaatgc aatgaaatcg tcattcatgt 114 0 
aggacgattt tctacttcct atatagaaat 1200 
aatgacattt aaaaacatat cccaaaaatc 1260 
catcattact tagaaaataa tattttcata 1320 
agaaccatct ccacccatta accaaaatga 1380 
tataacacta cgtaaagcca attcaagaag 1440 
taaccagacg aaccaaacca atatctgacc 1500 
aacggtctaa tgtgttacct tctaagaatc 1560 
cgaaagagtc cttcagcgca aggccgacct 1620 
cccggaacaa gaatcaccgg atgaaacgcc 1680 


<210> 12 
<211> 264 
15 <212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (39) 
20 <223> n= unknown 

<220> 

<221> unsure 
<222> (175) 
<223> n= unknown 

25 <220> 

<22l> unsure 
<222> (241) 
<223> n= unknown 


<400> 12 

30 ccaagaactc gatgattact tcaacactcc 
ttccaccaac tctcttctct catctcaatc 
cacccctggt agattcatta caaaagcttg 
ccccttatga ctttagatat ggtctagctg 
ngttcctcaa agatctaaag aatt 


tggggttgng acccgggtcc ctcactttgg 60 
ctcgtctcaa gcatatcacc ggatacatgg 120 
gctacgctga tggtgagact ctgtntggag 180 
ctgaaggtca cccttcacaa gtgggttcca 240 

264 


35 <210> 13 
<211> 273 
<212> DNA 
<213> Glycine max 

<220> 
40 <221> unsure 
<222> (12) 
<223> n=unknown 


<220> 

<22l> unsure 
45 <222> (33) 

<223> n= unknown 
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<220> 

<22l> unsure 
<222> (252) 
<223> n= unknown 

5 <220> 

<22l> unsure 
<222> (265) . . (266) 
<223> n=unknown 


<220> 
10 <221> unsure 
<222> (272) 
<223> n= unknown 

<400> 13 

ccaacatctg anaggggtag agtaggtatt 
15 ggaacaagaa gagggtctca agattgaggt 
gatgctgtca ttgctatgca catgtggggc 
aggtaacgga gggaaccaac tagaagcaag 
catctgcgat cntggtaccc tctcannaag 


tcnatctatt atccatattg tgatgaagaa 60 
tgctacactc acagttacag tagttgttgt 120 
aagcaacctc gaccctttga ttctaatacc 180 
gttgaccaat cagtacaagc cctctacttt 240 
ana 273 


<210> 14 
20 <211> 419 
<212> DNA 
<213> Glycine max 

<220> 

<221> unsure 
25 <222> (99) 

<223> n= unknown 

<220> 

<221> unsure 
<222> (346) 
30 * <223> n=unknown 

<220> 

<221> unsure 
<222> (352) 
<223> n=unknown 

35 <220> 

<22l> unsure 
<222> (392) 
<223> n=unknown 

<220> 
40. <221> unsure 
<222> (405) 
<223> n=unknown 
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<220> 

<221> unsure 
<222> (418) 
<223> n=unknown 


5 <400> 14 

gctgcatatg attggagaat agcatttcag 
cggataaaaa gcaacataga acttatggtt 
attccacatt caatgggggt cttgtacttc 
gctccaatgg gtggtggggg aggaccagat 
10 aacattggtg gaccattttt aggtgttccc 
gccaaggata ttgctgttgc caggacgata 
ccgcattcaa acccttgcaa catgtaatga 


aacactgagg tgagggatca aacactaagt 60 
gctactaang gtggaaataa ggcagttatt 120 
ctacatttta tgaaatgggt tgaagcacca 180 
tggtgctcca aatatataaa ggcagttgta 24 0 
aaggctatag cagggctatt ctcagctgag 300 
gctccaggat ttttanataa cnatctgttt 360 
anatgaaccc gttcnttggg actcaacna 419 


<210> IS 
<211> 272 
15 <212> DNA 

<213> Glycine max 

<220> 

<22l> unsure 
<222> (1) . . (272) 
20 <223> n= unknown 


<400> 15 

tganttgatc ntgngaagtn attctgtgta 
gtaagtgang ggtntgaaga gggaaagacg 
agcaacaggt tgcaggaaac aatggatcgg 
25 gccgcaggnn ggaagacaat aaacattata 
atgtttcntg tgcctgcaaa gcgatatttt 


ttanttccat gacatgaccg ttnnagatnc 60 
ctttttggtn ttngatatga ttttcgccaa 120 
ttggctgcna agttagaatc aanttataat 180 
nttcattcta tgggcggtct tttccnngan 240 
ga 272 


<210> 16 
<211> 237 
<212> DNA 
30 <213> Glycine max 

<220> 

<221> unsure 
<222> (1) . . (237) 
<223> n= unknown 


35 <400> 16 

gattttcgcc aaagcaacag gttgcaggaa 
tcaatntata atgcngcagg agggaagana 
cttttggtga aatgnttcat gtgcctgcaa 
tgggttgcaa tttgtgcgcc attccagggt 


acaatggatc ggttggctgc aaagttagaa 60 
ataaacatta taactcattc tatgggcggt 120 
agcgatattt ttgagaaata tgttaagaat 180 
gcaccaggaa ccatcaattc naccttt 237 


40 <2:10> 17 
<211> 244 
<212> DNA 
<213> Glycine max 
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<220> 

<221> unsure 
<222> (1) . . (244) 
<223> n=unknown 


5 <400> 17 

gattttcgcc aaagcaacag gttgcaggaa 
tgcaatttat aatgctgcag gagggaagaa 
tcttttggtg aaatgtttca tgtgcctgca 
ttgggttgca atttgtgcgc cattccaggg 
10 aaat 


acaatggatc ggtnggctgc aaagttagaa 60 
aataaacatt ataactcatt ctatgggcgg 120 
aagcgatatt tttgagaaat atgttaagaa 180 
tgcaccagga accatcaatt ctaccttttt 240 

244 


<210> 18 

<211> 263 

<212> DNA 

<213> Glycine max 


15 <400> 18 

gatgaaacta aaccgtgggc gactaagctt 
gttcgttgct tcatagaaga ggtcattggt 
ggaggattgg ttgcattgta ttttgcggca 
ttgcttaagc aacacctttt tgggggtttc 

20 cgaaaatatt tccatgggcc gga 


gtttactcgg ttgatttatg gcaagatcaa 60 
gaaccagtct atcttgtggg caactcacta 120 
aacaaccctc atttagtgaa aggtgtcgca 180 
tgccaaatcc cataaaaagt ccaagactag 240 

263 


<210> 19 
<211> 311 
<212> DNA 
<213> Zea mays 

25 <220> 

<221> unsure 
<222> (1) . . (311) 
<223> n=unknown 


<400> 19 

30 cggacgctgg ncatgttcgg agccccctac 
cagacgtccg aggtgtactc ccgctacttc 
agcgagagga cccggaagaa ggccgtcatc 
ctcgagttcg tccggaacac tccgccggcg 
ctggtcgcgc cgacgctccc cggcgggttc 

35 acggacatcc t 


gacttccgct acgcgccgcc gtcccccggc 60 
aaggagctga tggagctggt cgaggccgcg 120 
ctcggccaca gcttcggcgg catggtcgcg 180 
tggcggcgcg agcacatcga gcgcctcgtc 240 
ctggagccgg tgcgcaactt cgcgtccggg 300 

311 


<210> 20 
<211> 1155 
<212> DNA 
<213> 2ea mays 

40 <400> 20 

tcgacccacg cgtccggcca caagaaccct 
agagccgcac tggaagacat gggataccga 
gacttccgct acgcgccgcc gtcccccggc 
aaggagctga tggagctggt cgaggccgca 

45 ctcggccaca gcttcggcgg catggtcgcg 


ctcaagtcag actggtgcct cggaaagctg 60 
gacggagaca ccatgttcgg agccccctac 120 
cagacgtccg aggtgtactc ccgctacttc 180 
agcgagagga cccggaagaa ggccgtcatc 240 
ctcgagttcg tccggaacac tccgccggcg 300 


ffeOOClO: <WO 01 1630QA2_I_> 
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tggcggcgcg agcacatcga gcgcctcgtc 

ctggagccgg tgcgcaactt cgcgtccggg 

ccgctggcca cgcgagccat gtgragragc 

ccggccgtgt tcgggcgcct gcaggcgccg 

5 gcgtccgcgc acgacatgga gcgcttcctc 

cccttcagga gacgggccgt ccccaagatg 

acgtacatca gcggggtcgg caacaggacg 

ttcgacgcgg ccccggaggt ggcggcgtac 

agcgtcttgg cgtttgagaa ggagatgcgt 

10 tccatcaaga tcgataaggc ccagcattct 

agggtcattc aagaaattgt tgaggccaat 

gtcatgtatg cattaccgag ctgtgggggc 

tccgtgctta aaacggtcgt ggtgtggtct 

caattgcttg cctcaaaaaa aaaaaaaaaa 

15 aaaaaaaaar gggcg 


PCT7US00/23863 



ctggtcgcgc cgacgctccc cggcgggttc 360 
acggacatcc tmtacgtgcc agcgacgacg 420 
ttcgagagcg ccatcgtgaa cttcccgtcg 480 
ctcgtggtca ccagggagcg gaactactcc 54 0 
gccgccgtcg gctccggcga ggccgcggag 600 
ggcagcttcg cggcgccgat ggtgcccatg 660 
ccgctgcggc tggtgttctg gggcgacgac 720 
ggggacggag atggcaagat caatttgatc 780 
cggcagccgg agcagaagaa gcagttcaaa 840 
acgatcgtca cggatgattt tgccctgcac 900 
aatcagaaga ttccatccta aattcttcat 960 
caatagtggg ttgggagttg ggacatcggt 1020 
caattcaatc gattagttat ttgttaacgt 1080 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 114 0 

1155 


<210> 21 
<211> 328 
<212> DNA 
<213> Zea mays 

20 <400> 21 

gttggaatgc tcttcaactt tcttacgctc 

ctggcttaaa tgcatgctgc ttgaacccta 

aaggcctgat agtggtcttc tgcaattaca 

ctctcttcag tatggaaaga atgggtcaaa 

25 gcaattatcg ctgttccgta tgattggaga 

ctgtactttc acaattaaac aggatcag 


atttacattg cttttctctg cggtaaattg 60 
taatcagata gaccatcccg aatgcaagtc 120 
gagctggacc ctggttatat aacaggtcct 180 
tggtgtgtag agtttggcat tgaagctaat 240 
ctgcccccat caatgcttga ggagagagat 300 

328 


<210> 22 
<211> 356 
<212> DNA 
30 <213> Zea mays 

<400> 22 

gtctttctgc aattacagag ctggaccctg 
tcagtatgga aagaatgggt caaatggtgt 
atcgctgttc cgtatgattg gagactgccc 
35 tttcacaaat taaagtttgt aacacttgcc 
catctgttag gatcagtaag agttaatggc 
cagatcccac aagcatacgg ttaccgccaa 


gttatataac aggtttcagg tcctctctct 60 
gtagagtttg gcattgaagc taatgcaatt 120 
ccatcaatgc ttgaggagag agatctgtac 180 
tcaacfetgtt atgaagcaac caatgctata 240 
ccatgacgga ttcaggttcc tgctcaccaa 300 
tgcctgcagt tggacagtac caaccc 356 


<210> 23 
<211> 1552 
40 <212> DNA 

<213> Zea mays 

<400> 23 

tcgacccacg cgtccgcaga catgatcatt 
atgatagtgg aaatgattaa atggggatat 
45 tatgatttcc gtcaaagcaa caggctctca 
gagtcagtgt acacagcttc tggtggaaag 
ggattacttg tgaaatgttt catctcactg 


ggtgatgaca ctgtgtacta ctatcatgac 60 
caagaaggaa aaactctctt tggatttggt 120 
gagacacttg acagattttc taaaaagctg 180 
aagatcaatc tcattactca ttcaatgggg 240 
cacagtgata tatttgaaaa atatgtcaar 300 
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agttggatcg caattgctgc accattccar 
ctgaatggaa tgtcttttgt craaggatgg 
atgcascaat tgctacttga gtgcccatca 
ccagtggaga gacatcccac tgctacagat 
5 gaaaagtgcc ctgttagagt cgtatgagcc 
tctttccagt aatgagatca ttgctgatgg 
tatattgaat tgggcaaaga aacttatgat 
gtgaaattct acaacattta tgggattgat 
agtgaacagc agccggtttc aagtcttagt 

10 tatgttgatg gcgacggatc tgttcccgca 
gtggcaaggg ttggggttgc tcctgaccac 
cggatcgtcc agcactggct gcacgccgga 
gactatgtca tactcccaac acgcttacga 
tgtcacgtca gtagcggagg actgggagat 

15 gccaggcgag cttcctccta* tggtcagcac 
agcactggaa gaggcgcatg ccaccgtggt 
tgtgcaagtt agggctgtgg gtgtcagcca 
ttgtctactc tatctagcag tagcagctat 
tacatatatg gctatgacct ctgtagggat 

20 cctgcttgta aatgttcaga accgaaaaca 
aaaaataaaa agatggtaaa ggattccatt 


gg.tgcccctg ggtamataac taccaktytg 360 
gaaycaagat tctttatttc caaawkgkgt 420 
atctatgagk tgctgscaam ccctaacttt 480 
ttggagagag aatttggata mcagtggcaa 540 
tgaggaagca ataaagatga ttaaagaggc 600 
catgcatatt ccggtgcccc ttaatttgga 660 
cttttatgca gtacaaagct tccggaatca 720 
tatgatactc cacatactgt ctgctatggc 780 
agcctcttat atgctcaggg aaaatacgtc 840 
gaatcagcaa aggctgacgg atttaatgca 900 
cggggaatcg tgtgcagtcg ccgcgcgttc 960 
gaacctgacc cgttctacga cccgctgagc 1020 
aatcgagaag catcgtgaga aacacgggga 1080 
catctcccct aacgacggca agaccatrrg 1140 
actgaccacg agccgggaag gcaaggaggg 1200 
cgttcacccg gagaagaagg gacggcagca 1260 
tggtggctaa agccgtagga gccacgttgg 1320 
acctctgtgc acgcactgta aaattggatg 1380 
ctggttttag aagtataaat gggcaccctg 1440 
caggccctgt tctttttttt cctttttaaa 1500 
aaaaaaaaaa aaaaaaaagg eg 1552 


<210> 24 
<211> 227 
<212> DNA 
25 <213> 2ea mays 

<220> 

<221> unsure 
<222> (1) . . (227) 
<223> n=unknovm 

30 <400> 24 

ttggttatga tttccgtcaa agcaacaggc 
agctggagtc agtgtacaca gcttctggtg 
tggggggatt acttgtgaaa tgtntcatct 
tcaagagttg gntegcaatt gcngcaccat 


tctcagagac acttgacaga ttttctaaaa 60 
gaaagaagat caatctcatt actcattcaa 120 
cactgcacag tgatatatnt gaaaaatatg 180 
tccaaggtgc ccctggg 227 


35 <210> 25 

<211> 1587 
<212> DNA 
<213> Zea mays 

<220> 

40 <221> unsure 

<222> (1) . . (1587) 

<223> n=unknown 

<400> 25 

ggagattgtc gtgeeggagg acgaccacgg cctgtttgcc atcgacattc ttgatccttc 60 
45 ctggtttgta gaactcgacc cacgcgtccg cccaccgtcc gggagattgt cgtgccggag : 120 
gacgaccacg gcctgtttgc catcgacatt cttgatcctt cctggtttgt agaacttctc 180 
catctgtcta tggtgtatca cttccatgat atgattgata tgctcataaa ctgtggatat 240 
gagaaaggga ccacactatt tggatatggt tatgattttc gecaaagcaa caggatagac 300 
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aaagcgatgg ctggtttgag agcaaaactt 
aaagttaatt taatctcaca ttctatgggt 
aatcatgatg tattcactaa gtatgtcaac 
ggtgcccccg gatgcatcaa tgattctcta 
5 gagagcttct ttttcgtatc tagatgggca 
atctatgaaa tgttaccaaa tccagaattc 
tggcgtaaga accctgaaaa ggatggaact 
tgtgtgtcct tgttcgaaga agctttaagg 
gtagcactac cattcaatat gtcagtcttc 

10 gacaatgctg aattaccaga tactgtgagc 
actccatacg atgtatgcta tggctcagaa 
tgtcacacag tgccggcata cacttatgtg 
gcacgggctg atgggttctc tgcgaaagaa 
ctgctgtccg atgagaacgt attcaagctt 

15 aagtcagagt ggcgttgcgt gtctaaatcc 
tagttcttca ggaagactgt tattttggcc 
tctggtgatt gtattgtatg tctgcacgat 
gattagtgcc cttgtacgat gtatcattgg 
fcttntttgnc cttccactct ggacntagta 

20 nngtctgnrm nnnnnnnnnn nnnnnnannn 
nnnnnnnnnn rmaaaaaaaa awgkgaagcc 
aaaaaaaaaa aaaaaaaaaa aaaaaaa 


<210> 26 
<211> 300 
25 <212> DNA 

<213> Zea mays 


gagacagctc ataagacctc tggagggaaa 360 
ggattgctag tacgctgctt catgtctatg 420 
aaatggattt gcattgcttg tccattccaa 480 
cttactggat tgcaatttgt ttatggtttt 540 
atgcaccaat tgcttgtcga atgcccatca 600 
aagtggaagg aaaaaccaat tattcaggtt 660 
gtggagcttg ttcaatatga agcaactgat 720 
aataatgagc tcacgtataa cggaaagaaa 780 
aaatgggcca ccaagactcg ccaaatccta 84 0 
ttttacaata tatacgggac atcttatgaa 900 
agctctccga ttggagattt gtcagaagtg 960 
gatggagatt gcacggttcc catagaatcg 1020 
agagttggcg tcaaggcgga ccaccgtggc 1080 
ctcaagaaat ggctcggtgt gagcgagaag 1140 
tactccaaag tgacctaatt gggttgcctg 1200 
tttcctcctg aagagaagat gaaacaaaat 1260 
gtaaatctct gcaagctgca cggaacaagg 1320 
caggcatttn tttttgaacc tangggcata 1380 
aagaatatnt gaatcgacct tanttrmaan 1440 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1500 
gntnntnntt tnaaaagnnt tttnnnaaaa 1560 

1587 


<220> 

<221> unsure 
<222> (1) . . (300) 
30 <223> n= unknown 


<400> 26 

gacaaagcga tggctggttt gagagcaaaa 
aaaaaagtta atttaatctc acattctatg 
atgaatcatg atgtgagttt tcatgttttc 
35 atgtcaattt cccccatttt ctaggtattc 
gcttgtccat tccaaggtaa cttatgggac 


cttgagacag ctcataagac ctctggaggg 60 
ggtggattgc tagtacgctg cttcatgtct 120 
tgtgtttttt ttgcttttgc ataaatatcc 180 
actangtatg tcaacaaatg gatttgcatt 240 
atttcaattg tt tat tana t natggggncc 300 


<210> 27 
<211> 1240 
<212> DNA 
40 <213> Zea mays 

<220> 

<221> unsure 

<222> (1)..{1240) 

<223> n^unknown 

45 <400> 27 

tcgacccacg ' cgtccggttc 
tgccatattt gtaacacaga 
aagascaata ttgaactcat 


ccagttccca ccgtgtagat 
tt acta tat a caggttcgtg 
agwagsgaca aatggtggaa 


ggttctggta taaaatgtat 60 
atcaaacttt gagcagaata 120 
atagggtggt ggkmgatccc 180. 
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acnactccat ggggtcnttn attttntgcn 
tgggggcagt gggtccgaac tggntgtaga 
atctttctta ggagttccta aggctgttgc 
tgccggttgc taggtataag taatgattca 
5 aatgaatatt attggatgct cgacaagctt 
gacctcacaa gtccagggag tgcctgcctc 
atgcaccgac ggtaccgtga ggtcatttct 
ggcaaggtca ggactgacaa gacctaccct 
aagacaaacg agaactacag gctgctttcg 

10 gttcaagctc tacaaggtga ggtctgttca 
cacctacgac gaccgcacca tccgctaccc 
caagatcgat ctggagacca acaagatcat 
ggtcatggtg atcggcagga ggaataccgg 
gcataagggc aacttcgaga ccatccacgt 

15 ttctcctatt tgttgtacag gaaaacatag 
tgtggagact tgatttcata taaagttagg 
gtttagattt agagtacact atgtatgcgt 
aatggtactt atttatatta attaattaaa 


ttttacgnaa tggntcgaag ccctcctccg 240 
accatataaa gctgtaatga atattggagg 300 
tgggcttttt ttcttctaag caaaagatgt 360 
tttatttaaa gcaaaaggga atagcaaaag 420 
gcggagcttt tgctcccaag ccatcttctg 480 
tgatcctcat catcaggaac aggctcaagt 540 
atcctgatgc aacaccatgt acttgttgat 600 
gctgggttca tggatgtcat ttccatccct 660 
tcttcaccca atcagggatg aggatgccaa 720 
gtttggccag aaagacatcc cctatctgaa 780 
cgacccgctc atcaaggcca acgacaccat 840 
ggacttcatc atgtttgacg tcggcaacgt 900 
gcgtgtagga gtgatcaara taagggagaa 960 
gctgcttgra gctttttgct atgtctagtt 1020 
aatgaaattc aaatttggtg gccacaaaag 1080 
cttaacatta gtgcaaacag ttgtatttta 1140 
tgtttgacaa tgcttattta tgatatattg 1200 
aaaaaaaaaa 124 0 


<210> 28 
20 <211> 324 
<212> DNA 
<213> Zea mays 

<400> 28 

cgaatgctcc tgacatggaa atattttcca 
25 catatgtcta taagttggcc ccacaggcag 
cggctgaagg cggggaggaa aatagctgct 
atgaaactgt tccagttctt agtgcgggct 
ctcgtttcaa ccctgccggc agcaagactt 
ctactctcct ggaaggcagg ggca 


tgtacggagt aggcattcct actgaaaggg 60 
aatgttatat acctttccga attgacacct 120 
tgaaaggggg tgtttactta gccgatggtg 180 
acatgtgtgc aaaaggatgg cgtggcaaaa 240 
acgtgagaga atacagccat tcaccaccct 300 

324 


30 <210> 29 
<211> 254 
<212> DNA 
<213> Zea mays 


<400> 29 

35 gaataaagag caacattgaa ctcatggtag 
tcccacactc catgggggtc ctctattttt 
ctcccatggg gggtggcggt ggtccagact 
atattggagg acctttctta ggagttccta 
ccaaagatgt tgcc 


caacaaatgg tggaaatagg gtggtggtga 60 
tgcattttat gaaatgggtc gaagcacctc 120 
ggtgtgagaa gcatattaaa gctgtaatga 180 
aggctgttgc tggccttttc tcatctgaag 240 

254 


40 <210> 30 
<211> 518 
<212> DNA 
<213> Mus mus cuius 


<400> 30 

45 tggaggacaa cgcggggtct gatacgactc actataggga atttggccct cgagcagtag 60 

attcggcacg atgggcacga ggactccatc atgttcctca agctttattc ctaccgggat 120 

gtcaacctgt ggtgccgcca gcgaagggtc aaggccaaag ctgtctctac agggaagaag 180 
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gtcagtgggg ctgctgcgag caagctgtga gctatccaga caacctgacc taccgagatc 240 

tcgattactt catctttgct cctactttgt. gttatgaact caactttcct cggtcccccc 300 

gaatacgaga gcgctttctg ctacgacgag ttcttgagat gctctttttt acccagcttc 360 

aagtggggct gatccaacag tggatggtcc ctactatcca gaactccatg gaagcccttt 420 

caagagcttc tgcagttttg gagaccgcga gttctacaga gattggtgga atgctgagtc 480 

tgtcaccgac ttttggcaga actggaatat ccccgtgg 5x8 


<210> 31 
<21X> 299 
<212> DNA 
10 <213> Mus musculus 

<400> 31 

ccatgatggc tcaggtccca ctggcctgga ttgtgggccg attcttccaa gggaactatg 60 

gcaatgcagc tgtgtgggtg acactcatca ttgggcaacc ggtggctgtc tcatgtatgt 120 

ccacgactac tacgtgctca actacgatgc cccagtgggt catgagctac tgccaaaggc 180 

15 agccctccct aacctgggcc tggagttctg gaggggttcc tggctgcctg cacactcctc 240 

ctagtctggg aggcctctct gcccctatgc gctactcctg ctcttgggga tggcatttg 299 


<210> 32 
<211> 1895 
<212> DNA 
20 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequencer Inferred cDNA 
sequence 

<220> 
25 <221> unsure 

<222> (1) . . (1895) 
<223> n= unknown 

<400> 32 

gtctggtgtg atggggacag ggagggactt ccccttaccc agcactggtg ttggctgagg 60 
tgggtgctga gtctcagagc ttggcatgga gaccagacag ggctgggtct gcaagcctga 120 
ggctgccgcc ctgagctcgg gctgggacgt gcccagaggt gttgggagga tctggggtga 180 
gtaccctgtg gccaggacta aaggggctnc accctcctgt ccatccctcg cagatcttga 240 
gcaatgcccg gttatttctg gagaacctca tcaagtatgg catcctggtg gaccccatcc 300 
aggtggtttc tctgttcctg aaggatccct atagctggcc cgccccatgc ctggttattg 360 
cggccaatgt ctttgctgtg gctgcattcc aggttgagaa gcgcctggcg gtgggtgccc 420 
tgacggagca ggcgggactg ctgctgcacg tggccaacct ggccaccatt ctgtgtttcc 480 
cagcggctgt ggtcttactg gttgagtcta tcactccagt gggctccctg ctggcgctga 540 
tggcgcacac catcctcttc ctcaagctct tctcctaccg cgacgtcaac tcatggtgcc .600 
gcagggccag ggccaaggct gcctctgcag ggaagaaggc cagcagtgct gctgccccgc 660 
acaccgtgag ctacccggac aatctgacct accgcgatct ctactacttc ctcttcgccc 720 
ccaccttgtg ctacgagctc aactttcccc gctctccccg catccggaag cgctttctgc 780 
tgcgacggat ccttgagatg ctgttcttca cccagctcca ggtggggctg atccagcagt 840. 
99 a tggtccc caccatccag aactccatga agcccttcaa ggacatggac tactcacgca 900 
tcatcgagcg cctcctgaag ctggcggtcc ccaatcacct catctggctc atcttcttct 960 
actggctctt ccactcctgc ctgaatgccg tggctgagct catgcagttt ggagaccggg' 1020 
agttctaccg ggactggtgg aactccgagt ctgtcaccta cttctggcag aactggaaca 1080 
tccctgtgca caagtggtgc atcagacact tctacaagcc catgcttcga cggggcagca 1140 
gcaagtggat ggccaggaca ggggtgttcc tggcctcggc cttcttccac gagtacctgg 1200 


35 


40 
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tgagcgtccc tctgcgaatg ttccgcctct gggcgttcac gggcatgatg gctcagatcc 1260 
cactggcctg gttcgtgggc cgctttttcc agggcaacta tggcaacgca gctgtgtggc 1320 
tgtcgctcat catcggacag ccaatagccg tcctcatgta cgtccacgac tactacgtgc 1380 
tcaactatga ggccccagcg gcagaggcct gagctgcacc tgagggcctg gcttctcact 1440 
5 gccacctcac acccgctgcc agagcccacc tctcctccta ggcctcgagt gctggggatg 1500 
ggcctggctg cacagcatcc tcctctggtc ccagggaggc ctctctgccc ctatggggct 1560 
ctgtcctgca cccctcaggg atggcgacag caggccagac acagtctgat gccagctggg 1620 
agtcttgctg accctgcccc gggtccgagg gtgtcaataa agtgctgtcc agtgacctct .1680 
tcagcctgcc aggggcctgg ggcctggtgg ggggtatggc cacacccaca agggcgagtg 1740 
10 ccagagctgt gtggacagct gtcccaggac ctgccgggga gcagcagctc cactgcagca 1800 
999 c 999cat ggccggtagg gggagtgcaa ggccaggcag acgcccccat tccccacact 1860 
cccctaccta gaaaagctca gctcaggcgt cctct 1895 


<210> 33 

<211> 1766 

15 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Inferred cDNA 
sequence 

20 <400> 33 

cacgactggg ccgcgacgtg gtgcgggccg aagccatggg cgaccgcgga ggcgcgggaa 60 
gctctcggcg tcggaggacc ggctcgcggg tttccatcca gggtggtagt gggcccatgg 120 
tagacgaaga ggaggtgcga gacgccgctg tgggccccga cttgggcgcc gggggtgacg 180 
ctccggctcc ggctccggtt ccggctccag cccacacccg ggacaaagac cggcagacca 240 

25 gcgtgggcga cggccactgg gagctgaggt gccatcgtct gcaagactct ttgttcagct 300 
cagacagcgg tttcagcaat taccgtggta tcctgaattg gtgcgtggtg atgctgatcc 360 
tgagtaatgc aaggttattt ttagagaatc ttatcaagta tggcatcctg gtggatccca 420 
tccaggtggt gtctctgttt ctgaaggacc cctacagctg gcctgcccca tgcttgatca 480 
ttgcatccaa tatctttatt gtggctacat ttcagattga gaagcgcctg tcagtgggtg 540 

30 ccctgacaga gcagatgggg ctgctgctac atgtggttaa cctggccaca attatctgct 600 
tcccagcagc tgtggcctta ctggttgagt ctatcactcc agtgggttcc ctgtttgctc 660 
tggcatcata ctccatcatc ttcctcaagc ttttctccta ccgggatgtc aatctgtggt 720 
gccgccagcg aagggtcaag gccaaagctg tgtctgcagg gaagaaggtc agtggggctg 780 
ctgcccagaa cactgtaagc tatccggaca acctgaccta ccgagatctc tattacttca 840 

35 tctttgctcc tactttgtgt tatgaactca actttcctcg atccccccga atacgaaagc 900 
gctttctgct acggcgggtt cttgagatgc tctttttcac ccagcttcaa gtggggctga 960 
tccagcagtg gatggtccct actatccaga actccatgaa gcccttcaag gacatggact 1020 
attcacgaat cattgagcgt ctcttaaagc tggcggtccc caaccatctg atatggctca 1080 
tcttcttcta ttggcttttc cactcatgtc tcaatgctgt ggcagagctc ctgcagtttg 1140 

40 gagaccgcga gttctacagg gactggtgga atgctgagtc tgtcacctac ttttggcaga 1200 
actggaatat ccccgtgcac aagtggtgca tcagacactt ctacaagcct atgctcagac 1260 
tgggcagcaa caaatggatg gccaggactg gggtcttttt ggcgtcagcc ttcttccatg 1320 
agtacctagt gagcattccc ctgaggatgt tccgcctctg ggcattcaca gccatgatgg 1380 
ctcaggtccc actggcctgg attgtgaacc gcttcttcca agggaactat ggcaatgcag 1440 

45 ctgtgtgggt gacactcatc attgggcaac cggtggctgt gctcatgtat gtccacgact 1500 
actacgtgct caactatgat gccccagtgg gggcctgagq tactgccaaa ggccagccct 1560 
ccctaacctg ggcctggagt tctggagggc ttcctggctg cctgcacact cctcctagtc 1620 
tgggaggcct ctctgcccct atggggccta ctcctgctct tggggatggc acctgagtcc 1680 
agctggtatg agccagtgct gggagtctgt gctgaccagg ggctgaggat atcaataaag 1740 

50 agctatctaa aaaaaaaaaa aaaaaa 1766 
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<210> 34 
<211> 409 
<212> PRT 

<213> Homo sapiens 
5 <400> 34 

Arg Arg Ser Leu Leu Asp -Glu Leu Leu Glu Val Asp His lie Arg Thr 
1 5 V>;5~ io 15 

He Tyr His Met Phe He Ala Leu Leu He Leu Phe He Leu Ser Thr 
20 25 30 

10 Leu Val Val Asp Tyr He Asp Glu Gly Arg Leu Val Leu Glu Phe Ser 
35 40 45 

Leu Leu Ser Tyr Ala Phe Gly Lys Phe Pro Thr Val Val Trp Thr Trp 
50 55 60 

Trp He Met Phe Leu Ser Thr Phe Ser Val Pro Tyr Phe Leu Phe Gin 
15 65 70 75 80 

His Trp Arg Thr Gly Tyr Ser Lys Ser Ser His Pro Leu He Arg Ser 
85 90 95 

Leu Phe His Gly Phe Leu Phe Met He Phe Gin He Gly Val Leu Gly 
100 105 no 

20 Phe Gly Pro Thr Tyr Val Val Leu Ala Tyr Thr Leu Pro Pro Ala Ser 
115 120 125 

Arg Phe He He He Phe Glu Gin lie Arg Phe Val Met Lys Ala His 
130 135 - 140 

Ser Phe Val Arg Glu Asn Val Pro Arg Val Leu Asn Ser Ala Lys Glu 
25 145 150 155 160 

Lys Ser Ser Thr Val Pro He Pro Thr Val Asn Gin Tyr Leu Tyr Phe 
165 170 175 

Leu Phe Ala Pro Thr Leu He Tyr Arg Asp Ser Tyr Pro Arg Asn Pro 
180 185 i.90 

30 Thr Val Arg Trp Gly Tyr Val Ala Met Lys Phe Ala Gin Val Phe Gly 
195 200 205 

Cys Phe Phe Tyr Val Tyr Tyr lie Phe Glu Arg Leu Cys Ala Pro Leu 
210 215 220 

Phe Arg Asn He Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val Leu 
35 225 230 235 240 

Cys Val Phe Asn Ser lie Leu Pro Gly Val Leu lie Leu Phe Leu Thr 
245 250 255 

Phe Phe Ala Phe Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu 
260 265 270 
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Arg Phe Gly Asp Arg Met Phe Tyr Lye Asp Trp Trp Asn Ser Thr Ser 
275 280 285 

Tyr Ser Asn Tyr Tyr Arg Thr Trp Asn Val Val Val His Asp Trp Leu 
290 295 300 

5 Tyr Tyr Tyr Ala Tyr Lys Asp Phe Leu Trp Phe Phe Ser Lys Arg Phe 
305 310 315 320 

Lys Ser Ala Ala Met Leu Ala Val Phe Ala Val Ser Ala Val Val His 
325 330 335 

Glu Tyr Ala Leu Ala Val Cys Leu Ser Phe Phe Tyr Pro Val Leu Phe 
10 340 345 350 

Val Leu Phe Met Phe Phe Gly Met Ala Phe Asn Phe He Val Asn Asp 
355 360 365 

Ser Arg Lys Lys Pro He Trp Asn Val Leu Met Trp Thr Ser Leu Phe 
370 375 380 

15 Leu Gly Asn Gly Val Leu Leu Cys Phe Tyr Ser Gin Glu Trp Tyr Ala 
385 390 395 400 

Arg Arg His Cys Pro Leu Lys Asn Pro 
405 


<210> 35 
20 <211> 409 
<212> PRT 
<213> Mas musculus 

<400> 35 

Arg Gin Ser Leu Leu Asp Glu Leu Phe Glu Val Asp His He Arg Thr 
25 1 5 10 15 

He Tyr His Met Phe He Ala Leu Leu He Leu Phe Val Leu Ser Thr 
20 25 30 

lie Val Val Asp Tyr He Asp Glu Gly Arg Leu Val Leu Glu Phe Asn 
35 40 45 

30 Leu Leu Ala Tyr Ala Phe Gly Lys Phe Pro Thr Val He Trp Thr Trp 
50 55 60 

Trp Ala Met Phe Leu Ser Thr Leu Ser He Pro Tyr Phe Leu Phe Gin 
65 70 75 80 

Pro Trp Ala His Gly Tyr Ser Lys Ser Ser His Pro Leu He Tyr Ser 
3S 85 90 95 

Leu Val His Gly Leu Leu Phe Leu Val Phe Gin Leu Gly Val Leu Gly 
100 105 no 
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Phe Val Pro Thr Tyr Val Val Leu Ala Tyr Thr Leu Pro Pro Ala Ser 
115 120 125 

Arg Phe lie Leu He Leu Glu Gin He Arg Leu He Met Lys Ala His 
130 135 140 

5 Ser Phe Val Arg Glu Asn He Pro Arg Val Leu Asn Ala Ala Lys Glu 
145 150 155 160 

Lys Ser Ser Lys Asp Pro Leu Pro Thr Val Asn Gin Tyr Leu Tyr Phe 
165 170 175 

Leu Phe Ala Pro Thr Leu He Tyr Arg Asp Asn Tyr Pro Arg Thr Pro 
10 180 1S5 190 

Thr Val Arg Trp Gly Tyr Val Ala Met Gin Phe Leu Gin Val Phe Gly 
195 200 205 

Cys Leu Phe Tyr Val Tyr Tyr He Phe Glu Arg Leu Cys Ala Pro Leu 
210 215 220 

15 Phe Arg Asn He Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val Leu 
225 230 235 240 

Cys Val Phe Asn Ser He Leu Pro Gly Val Leu He Leu Phe Leu Ser 
245 250 255 

Phe Phe Ala Phe Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu 
20 260 265 270 

Arg Phe Gly Asp Arg Met Phe Tyr Lys Asp Trp Trp Asn Ser Thr Ser 
275 280 285 

Tyr Ser Asn Tyr Tyr Arg Thr Trp Asn Val Val Val His Asp Trp Leu 
290 295 300 

25 Tyr Tyr Tyr Val Tyr Lys Asp Leu Leu Trp Phe Phe Ser Lys Arg Phe 
305 310 315 320 

Lys Ser Ala Ala Met Leu Ala Val Phe Ala Leu Ser Ala Val Val His 
325 330 335 

Glu Tyr Ala Leu Ala lie Cys Leu Ser Tyr Phe Tyr Pro Val Leu Phe 
30 340 345 350 

. Val Leu Phe Met Phe Phe Gly Met Ala Phe Asn Phe He Val Asn Asp 
355 360 365 

Ser Arg Lys Arg Pro He Trp Asn lie Met Val Trp Ala Ser Leu Phe 
370 375 380 

35 Leu Gly Tyr Gly Leu He Leu Cys Phe Tyr Ser Gin Glu Trp Tyr Ala 
385 390 395 400 

Arg Gin His Cys Pro Leu Lys Asn Pro 
405 
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<210x 36 
<211> 429 
<212> PRT 

<213> Saccharomyces cerevisiae 
5 <400> 36 

Asp Lys Ala Asp Ala Pro Pro Gly Glu Lys Leu Glu Ser Asn Phe Ser 
1 5 io 15 

Gly lie Tyr Val Phe Ala Trp Met Phe Leu Gly Trp He Ala He Arg 
20 25 30 

10 Cys Cys Thr Asp Tyr Tyr Ala Ser Tyr Gly Ser Ala Trp Asn Lys Leu 
35 40 45 

Glu He Val Gin Tyr Met Thr Thr Asp Leu Phe Thr He Ala Met Leu 
50 55 60 

Asp Leu Ala Met Phe Leu Cys Thr Phe Phe Val Val Phe Val His Trp 
15 65 70 75 60 

Leu Val Lys Lys Arg He He Asn Trp Lys Trp Thr Gly Phe Val Ala 
85 90 95 

Val Ser He Phe Glu Leu Ala Phe lie Pro Val Thr Phe Pro He Tyr 
100 los no 

20 Val Tyr Tyr Phe Asp Phe Asn Trp Val Thr Arg He Phe Leu Phe Leu 
115 120 125 

His Ser Val Val Phe Val Met Lys Ser His Ser -Phe Ala Phe Tyr Asn 
13 ° 135 140 

Gly Tyr Leu Trp Asp lie Lys Gin Glu Leu Glu Tyr Ser Ser Lys Gin 
25 145 150 155 160 

Leu Gin Lys Tyr Lys Glu Ser Leu Ser Pro Glu Thr Arg Glu He Leu 
165 170 175 

Gin Lys Ser Cys Asp Phe Cys Leu Phe Glu Leu Asn Tyr Gin Thr Lys 
180 185 190 

30 Asp Asn Asp Phe Pro Asn Asn lie Ser Cys Ser Asn Phe Phe Met Phe 
195 200 205 

Cys Leu Phe Pro Val Leu Val Tyr Gin lie Asn Tyr Pro Arg Thr Ser 
210 215 220 

Arg lie Arg Trp Arg Tyr Val Leu Glu Lys Val Cys Ala He lie Gly 
35 225 230 235 240 

Thr lie Phe Leu Met Met Val Thr Ala Gin Phe Phe Met His Pro Val 
245 250 255 

Ala Met Arg Cys lie Gin Phe His Asn Thr Pro Thr Phe Gly Gly Trp 
260 265 270 
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He Pro Ala Thr Gin Glu Trp Phe His Leu Leu Phe Asp Met He Pro 
275 280 285 

Gly Phe Thr Val Leu Tyr Met Leu Thr Phe Tyr Met He Trp Asp Ala 
290 295 300 

5 Leu Leu Asn Cys Val Ala Glu Leu Thr Arg Phe Ala Asp Arg Tyr Phe 
305 310 315 320 

Tyr Gly Asp Trp Trp Asn Cys Val Ser Phe Glu Glu Phe Ser Arg He 
325 330 335 

Trp Asn Val Pro Val His Lys Phe Leu Leu Arg His Val Tyr His Ser 
340 345 350 

Ser Met Gly Ala Leu His Leu Ser Lys Ser Gin Ala Thr Leu Phe Thr 
355 360 365 

Phe Phe Leu Ser Ala Val Phe His Glu Met Ala Met Phe Ala He Phe 
370 375 380 

15 Arg Arg Val Arg Gly Tyr Leu Phe Met Phe Gin Leu Ser Gin Phe Val 
385 390 395 400 

Trp Thr Ala Leu Ser Asn Thr Lys Phe Leu Arg Ala Arg Pro Gin Leu 
405 410 415 

Ser Asn Val Val Phe Ser Phe Gly Val Cys Ser Gly Pro 
20 420 425 


<210> 37 
<211> 432 
<212> PRT 

<213> Saccharomyces cerevisiae 
25 <400> 37 

Glu Thr Val Val Thr Val Glu Thr Thr He He Ser Ser Asn Phe Ser 
1 5 10 15 

Gly Leu Tyr Val Ala Phe Trp Met Ala lie Ala Phe Gly Ala Val Lys 
20 25 30 

30 Ala Leu He Asp Tyr Tyr Tyr Gin His Asn Gly Ser Phe Lys Asp Ser 
35 40 45 

Glu He Leu Lys Phe Met Thr Thr Asn Leu Phe Thr Val Ala Ser Val 
50 55 60 

Asp Leu Leu Met Tyr Leu Ser Thr Tyr Phe Val Val Gly He Gin Tyr 
35 65 70 75 80 

Leu Cys Lys Trp Gly Val Leu Lys Trp Gly Thr Thr Gly Trp He Phe 
85 90 95 
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Gin Met Pro Leu Val Ala Leu Thr Asn Thr Lys Phe Met Arg Asn Arg 
405 410 415 

Thr lie He Gly Asn Val He Phe Trp Leu Gly He Cys Met Gly Pro 
420 425 430 

5 


<210> 38 
<211> 1942 
<212> DNA 

<213> Arabidopsis thaliana 
10 <400> 38 

ctctcgtgaa tcctttttcc tttcttcttc ttcttctctt cagagaaaac tttgcttctc 60 
tttctataag gaaccagaca cgaatcccat tcccaccgat ttcttagctt cttccttcaa 120 
tccgctcttt ccctctccat tagattctgt ttcctctttc aatttcttct gcatgcttct 180 
cgattctctc tgacgcctct tttctcccga cgctgtttcg tcaaacgctt ttcgaaatgg 240 

15 cgattttgga ttctgctggc gttactacgg tgacggagaa cggtggcgga gagttcgtcg 300 
atcttgatag gcttcgtcga cggaaatcga gatcggattc ttctaacgga cttcttctct 360 
ctggttccga taataattct ccttcggatg atgttggagc tcccgccgac gttagggatc 420 
ggattgattc cgttgttaac gatgacgctc agggaacagc caatttggcc ggagataata 480 
acggtggtgg cgataataac ggtggtggaa gaggcggcgg agaaggaaga ggaaacgccg 540 

20 atgctacgtt tacgtatcga ccgtcggttc cagctcatcg gagggcgaga gagagtccac 600 
ttagctccga cgcaatcttc aaacagagcc atgccggatt attcaacctc tgtgtagtag 660 
ttcttattgc tgtaaacagt agactcatca tcgaaaatct tatgaagtat ggttggttga 720 
tcagaacgga tttctggttt agttcaagat cgctgcgaga ttggccgctt ttcatgtgtt 780 
gtatatccct ttcgatcttt cctttggctg cctttacggt tgagaaattg gtacttcaga 840 

25 aatacatatc agaacctgtt gtcatctttc ttcatattat tatcaccatg acagaggttt 900 
tgtatccagt ttacgtcacc ctaaggtgtg attctgcttt tttatcaggt gtcactttga 960 
tgctcctcac ttgcattgtg tggctaaagt tggtttctta tgctcatact agctatgaca 1020 
taagatccct agccaatgca gctgataagg ccaatcctga agtctcctac tacgttagct 1080 
tgaagagctt ggcatatttc atggtcgctc ccacattgtg ttatcagcca agttatccac 1140 

30 gttctgcatg tatacggaag ggttgggtgg ctcgtcaatt tgcaaaactg gtcatattca 1200 
ccggattcat gggatttata atagaacaat atataaatcc tattgtcagg aactcaaagc 1260 
atcctttgaa aggcgatctt ctatatgcta ttgaaagagt gttgaagctt tcagttccaa 1320 
atttatatgt gtggctctgc atgttctact gcttcttcca cctttggtta aacatattgg 1380 
cagagcttct ctgcttcggg gatcgtgaat tctacaaaga ttggtggaat gcaaaaagtg 1440 

35 tgggagatta ctggagaatg tggaatatgc ctgttcataa atggatggtt cgacatatat 1500 
acttcccgtg cttgcgcagc aagataccaa agacactcgc cattatcatt gctttcctag 1560 
tctctgcagt ctttcatgag ctatgcatcg cagttccttg tcgtctcttc aagctatggg 1620 
cttttcttgg gattatgttt caggtgcctt tggtcttcat cacaaactat ctacaggaaa 1680 
ggtttggctc aacggtgggg aacatgatct tctggttcat cttctgcatt ttcggacaac 1740 

40 cgatgtgtgt gcttctttat taccacgacc tgatgaaccg aaaaggatcg atgtcatgaa 1800 
acaactgttc aaaaaatgac tttcttcaaa catctatggc ctcgttggat ctccgttgat 1860 
gttgtggtgg ttctgatgct aaaacgacaa atagtgttat aaccattgaa gaagaaaaga 1920 
caattagagt tgttgtatcg ca 1942 

<210> 39 
45 <211> 520 
<212> PRT 

<213> Arabidopsis thaliana 


BNSOOCtO: <WO 01 1630BA2J_> 


WO 01/J630S PCTAJS00/23863 

31 


<400> 39 

Met Ala lie Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 
1 5 10 15 

Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 
5 20. 25 30 

Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 
35 40 45 

Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg lie Asp 
50 55 60 

10 Ser Val Val Asn Asp Asp Ala Gin Gly Thr Ala Asn Leu Ala Gly Asp 
65 70 75 80 

Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 
85 90 95 

Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 
15 100 105 110 

Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala He Phe 
115 120 125 

Lys Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu He 
130 135 140 

20 Ala Val Asn Ser Arg Leu He He Glu Asn Leu Met Lys Tyr Gly Trp 
145 150 155 160 

Leu He Arg Thr Asp Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp 
165 170 175 

Pro Leu Phe Met Cys Cys He Ser Leu Ser He Phe Pro Leu Ala Ala 
25 180 185 190 

Phe Thr Val Glu Lys Leu Val Leu Gin Lys Tyr He Ser Glu Pro Val 
195 200 205 

Val He Phe Leu His He He He Thr Met Thr Glu Val Leu Tyr Pro 
210 215 220 

30 Val Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr 
225 230 235 240 

Leu Met Leu Leu Thr Cys He Val Trp Leu Lys Leu Val Ser Tyr Ala 
245 . 250 255 

His Thr Ser Tyr Asp He Arg Ser Leu Ala . Asn Ala Ala Asp Lys Ala 
35 260 265 270 

Asn Pro Glu Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe 
275* 280 285 
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Thr Ser lie Tyr Glu Phe Leu Phe Val He Phe Tyr Met Tyr Leu Thr 
100 105 lio 

Glu Asn He Leu Lys Leu His Trp Leu Ser Lys He Phe Leu Phe Leu 
115 120 125 

5 His Ser Leu Val Leu Leu Met Lys Met His Ser Phe Ala Phe Tyr Asn 
130 135 140 

Gly Tyr Leu Trp Gly He Lys Glu Glu Leu Gin Phe Ser Lys Ser Ala 
145 150 155 160 

Leu Ala Lys Tyr Lys Asp Ser He Asn Asp Pro Lys Val He Gly Ala 
10 165 170 175 

Leu Glu Lys Ser Cys Glu Phe Cys Ser Phe Glu Leu Ser Ser Gin Ser 
180 185 190 

Leu Ser Asp Gin Thr Gin Lys Phe Pro Asn Asn He Ser Ala Lys Ser 
195 200 205 

15 Phe Phe Trp Phe Thr Met Phe Pro Thr Leu He Tyr Gin He Glu Tyr 
210 215 220 

Pro Arg Thr Lys Glu lie Arg Trp Ser Tyr Val Leu Glu Lys He Cys 
225 230 235 240 

Ala He Phe Gly Thr He Phe Leu Met Met lie Asp Ala Gin He Leu 
20 245 250 255 

Met Tyr Pro Val Ala Met Arg Ala Leu Ala Val Arg Asn Ser Glu Trp 
260 265 270 

Thr Gly He Leu Asp Arg Leu Leu Lys Trp Val Gly Leu Leu Val Asp 
275 280 285 

25 He Val Pro Gly Phe lie Val Met Tyr He Leu Asp Phe Tyr Leu He 
290 295 300 

Trp Asp Ala He Leu Asn Cys Val Ala Glu Leu Thr Arg Phe Gly Asp 
305 310 315 320 

Arg Tyr Phe Tyr Gly Asp Trp Trp Asn Cys Val Ser Trp Ala Asp Phe 
30 325 330 335 

Ser Arg He Trp Asn He Pro Val His Lys Phe Leu Leu Arg His Val 
340 . 345 350 

Tyr His Ser Ser Met Ser Ser Phe Lys Leu Asn Lys Ser Gin Ala Thr 
355 360 365 

35 Leu Met Thr Phe Phe Leu Ser Ser Vai Val His Glu Leu Ala Met Tyr 
370 375 380 

Val He Phe Lys Lys Leu Arg Phe Tyr Leu Phe Phe Phe Gin Met Leu 
385 390 395 400 
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Met Val Ala Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Ala 
290 295 300 

Cys He Arg Lys Gly Trp Val Ala Arg Gin Phe Ala Lys Leu Val He 
305 310 315 320 

' 5 Phe Thr Gly Phe Met Gly Phe He He Glu Gin Tyr lie Asn Pro He 

325 330 335 

Val Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala He 
340 345 350 

Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys 
10 355 360 365 

Met Phe Tyr Cys Phe Phe His Leu Trp Leu Asn He Leu Ala Glu Leu 
370 375 380 

Leu Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys 
385 390 395 400 

15 Ser Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp 

405 410 415 

Met Val Arg His He Tyr Phe Pro Cys Leu Arg Ser Lys He Pro Lys 
420 425 430 

Thr Leu Ala He He lie Ala Phe Leu Val Ser Ala Val Phe His Glu 
20 435 440 445 

Leu Cys He Ala Val Pro Cys Arg Leu Phe Lys Leu Trp Ala Phe Leu 
450 455 460 

Gly He Met Phe Gin Val Pro Leu Val Phe He Thr Asn Tyr Leu Gin 
465 470 475 480 

25 Glu Arg Phe Gly Ser Thr Val Gly Asn Met He Phe Trp Phe He Phe 

485 490 495 

Cys He Phe Gly Gin Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu 
500 505 510 

Met Asn Arg Lys Gly Ser Met Ser 
30 515 520 


<210> 40 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 40 

5 tgcaaattga cgagcacacc aaccccttc 


<210> 41 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: Synthetic 
oligonuclotide primer 

<400> 41 

aaggatgctt tgagttcctg acaatagg 28 

15 <210> 42 

<211> 1942 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 42 

20 ctctcgtgaa tcctttttcc tttcttcttc ttcttctctt cagagaaaac tttgcttctc 60 
tttctataag gaaccagaca cgaatcccat tcccaccgat ttcttagctt cttccttcaa 120 
tccgctcttt ccctctccat tagattctgt ttcctctttc aatttcttct gcatgcttct 180 
cgattctctc tgacgcctct tttctcccga cgctgtttcg tcaaacgctt ttcgaaatgg 240 
cgattttgga ttctgctggc gttactacgg tgacggagaa cggtggcgga gagttcgtcg 300 

25 atcttgatag gcttcgtcga cggaaatcga gatcggattc ttctaacgga cttcttctct 360 
ctggttccga taataattct ccttcggatg -atgttggagc tcccgccgac gttagggatc 420 
ggattgattc cgttgttaac gatgacgctc agggaacagc caatttggcc ggagataata 4 80 
acggtggtgg cgataataac ggtggtggaa gaggcggcgg agaaggaaga ggaaacgccg 540 
atgctacgtt tacgtatcga ccgtcggttc cagctcatcg gagggcgaga gagagtccac 600 

30 ttagctccga cgcaatcttc aaacagagcc atgccggatt attcaacctc tgtgtagtag 660 
ttcttattgc tgtaaacagt agactcatca tcgaaaatct tatgaagtat ggttggttga 720 
tcagaacgga tttctggttt agttcaagat cgctgcgaga ttggccgctt ttcatgtgtt 780 
gtatatccct ttcgatcttt cctttggctg cctttacggt tgagaaattg gtacttcaga 840 
aatacatatc agaacctgtt gtcatctttc ttcatattat tatcaccatg acagaggttt 900 

35 tgtatccagt ttacgtcacc ctaaggtgtg attctgcttt tttatcaggt gtcactttga 960 
tgctcctcac ttgcattgtg tggctaaagt tggtttctta tgctcatact agctatgaca 1020 
taagatccct agccaatgca gctgataagg ccaatcctga agtctcctac tacgttagct 1080 
tgaagagctt ggcatatttc atggtcgctc ccacattgtg ttatcagcca agttatccac 1140 
gttctgcatg tatacggaag ggttgggtgg ctcgtcaatt tgcaaaactg gtcatattca 1200 

40 ccggattcat gggatttata atagaacaat atataaatcc tattgtcagg aactcaaagc 1260 
atcctttgaa aggcgatctt ctatatgcta ttgaaagagt gttgaagctt tcagttccaa 1320 
atttatatgt gtggctctgc atgttctact gcttcttcca cctttggtta aacatattgg 138Q 
cagagcttct ctgcttcggg gatcgtgaat tctacaaaga ttggtggaat gcaaaaagtg 1440 
tgggagatta ctggagaatg tggaatatgc ctgttcataa atggatggtt cgacatatat 1500 

45 acttcccgtg cttgcgcagc aagataccaa agacactcgc cattatcatt gctttcctag 1560 
tctctgcagt ctttcatgag ctatgcatcg cagttccttg tcgtctcttc aagctatggg 1620 
cttttcttgg gattatgttt caggtgcctt tggtcttcat cacaaactat ctacaggaaa 1680 
ggtttggctc aacggtgggg aacatgatct tctggttcat cttctgcatt ttcggacaac 1740 
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cgatgtgtgt gcttctttat taccacgacc 
acaactgttc aaaaaatgac tttcttcaaa 
gttgtggtgg ttctgatgct aaaacgacaa 
caattagagt tgttgtatcg ca 


tgatgaaccg aaaaggatcg atgtcatgaa 1800 
catctatggc ctcgttggat ctccgttgat 1860 
atagtgttat aaccattgaa gaagaaaaga 1920 

1942 


5 <210> 43 
<211> 234 
<212> DNA 
<213> Glycine max 

<220> 
10 <221> unsure 

<222>- (1) . . (234) 
<223> n=unknown 

<400> 43 

gtaagcttca agagcttagc atanttcctg 
15 ctatcctcgc acaccttata ttcgaaaggg 
atatttacag gagttatggg atttataata 
tcacagcatc ctctcaaggg aaaccttctt 


gttgccccta ncattatgtt accagccaan 60 
ttggctgttt cgccaacttg tcaactgata 120 
gaacaataca ttaatcccat tgtacaaaat 180 
tacgccatcg agagagttct gaag 234 


<210> 44 
<211> 267 
20 <212> DNA 

<213> Glycine max 

<400> 44 

ctgcttttgt atctggtgtc acgttgatgc 
tgtcatatgc acatacaaac tatgatatga 
25 aaacattacc caatactttg atatggagta 
atacttcatg gttgctccta cattatgcta 
tcgaaagggt tgggtgtttc gtcaact 


tattaacttg cattgtgtgg ttaaaattgg 60 
gagcacttac tgtttcgaat gaaaagggag 120 
tccgtacact gtgaccttca ggagtttggc 180 
tcagacaagc tatcctcgca caccttcagt 240 

267 


<210> 45 
<211> 275 
30 <212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (1) . . (275) 
35 <223> n= unknown 


<400> 45 

gtggaatgcc aaaactgttg aagattattg 
gatgatccgc cacctatatt ttccatgttt 
tttaattgcc ttcctggttc tgctttattc 
40 catattcaag tngtgggttt cngnggaatt 
attnntnggc naaaaaattc cnngaacccc 


gaggatgtgg aatatgcctg ttcacaaatg 60 
aaggcacggt ataccaaagg ccgttgctct 120 
catgagctgt gcatcgctgt tccttgccca 180 
nagtttcagg tnccttgggt ttcnaccnna 240 

ggggg 275 
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<210> 46 

<211> 257 

<212> DNA 

<213> Glycine max 


5 <400> 46 

aacggaattg agactccaga gaatatgcca 
ggcttttgga aaaactggca tgcttccttc 
cctcttgggg gatctaagaa aaagctacta 
atctggcatg atttagagtg gaagcttctt 
10 atccctgagt tggtttt 


aaatgtatta ataattgtca caacttggaa 60 
aacaagtggc ttgtgaggta tatatacatt 120 
aatgtgtggg ttgttttcac atttgttgca 180 
tcatgggcat ggttgacgtg tttattcttc 24 0 

257 


<210> 47 
<211> 253 
<212> DNA 
<213> Zea mays 

15 <400> 47 

agaaaatgga acatgcctgt gcataaatgg 
cgaaatggta tatcaaagga agttgctgtt 
atgagttatg tgttgctgtt ccctgccaca 
atgcttcaga ttcccctcat catattgaca 

20 atggttggca at a 


attgttcgtc atatatattt tccttgcatg 60 
tttatatcgt tcttgtttct gctgtacttc 120 
tactcaagtt ctgggctttt tttaggaatc 180 
tcatacctca aaaataaatt cagtgacaca 240 

253 


<210> 48 
<211> 254 
<212> DNA 
<213> Zea mays 


25 <400> 48 

tgaagtatgg cttattaata agatctggct 
ggccactgct aatgtgttgc cttagtctac 
aaaagttggc attcaacaat ctcattagtg 
ttacaacatt tgaaattgta tatccagtgc 

3 0 tatcaggctt tgtg 


tttggtttaa tgctacatca ttgcgagact 60 
ccatatttcc ccttggtgca tttgcagtcg 120 
atcctgctac tacctgtttt cacatccttt 180 
tcgtgattct taagtgtgat tctgcagttt 240 

254 


<210> 49 
<211> 262 
<212> DNA 
<213> Zea mays 


35 <400> 49 

gaagtatggc ttattaataa gatctggctt 
gccactgcta atgtgttgcc ttagtctacc 
aaagttggca ttcaacaatc tcattagtga 
tacaacattt gaaattgtat atccagtgct 

40 acaggctttg tgttgatgtt ta 


ttggtttaat gctacatcat tgcgagactg 60 
catatttccc cttggtgcat ttgcagtcga 120 
tcctgctact acctgttttc acatcctttt 180 
cgtgattctt aagtgtgatt ctgcagtttt 240 

262 


<210> 50 
<211> 325 
<212> DNA 
<213> Zea mays 
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<220> 

<221> unsure 
<222> (1) . . (325) 
<223 > n=unknown 


5 <40O> 50 

taatcnaacc tcgntncngg ttcagctgta 
cacatantca natctnggca tnncngggat 
aagatatctc catgctacgt tcaagcatgt 
cagtatagtc ggacagccga tgtnnnnrma 
10 ggcccaggca agtagatagt ncggcagaga 
nacngagcga gcggcangaa ncagc 


tnccatgaga tatgtaatgc ggtgccgtgc 60 
catngttcag ataccgntgg nattcttgac 120 
aatggtgggc aacatgatan tttggntctn 180 
tctatactac catgacgtca tgaacaggca 240 
catgtacttc aacatcganc atcagnagca 300 

325 


<210> 51 
<211> 519 
<212> DNA 
15 <213> Mortierrella alpina 

<220> 

<221> unsure 
<222> (l) . . (519) 
<223> n=unknovm 


20 <400> 51 

gagnnnngna acgtttagcc tnccgtagcc 
ttanactnaa ttngaaaatn cnnncccaac 
aacggagcac tatatttacc ccgtggtngt 
tggtgctgct cttgataccc ttcatgctta 

25 agtgcatctg caacgccttt gcggaactaa 
attggtggaa ctgcgtcagc tttgatgagt 
acttcttgct ccgccacgtg tacgactcga 
caatgccgcn aattgcaaac gttccttccc 
aagaatgggt ggtgacaacg ttaaagtgcg 


gccaaaatcc aagggncnac cnaccctncg 60 
ttnaggnact tnnagncccc ccnacttgac 120 
tcaacccagc catctcaccc ttgcgagcat 180 
actatctcat gatcttttac atcattttcg 240 
gttgctttgc ggatcgcaac ttttacgagg 300 
gggcacgcaa atggaacaag cctgtgcaac 360 
gcatccgagt ccttccactt gtccgaaatc 420 
ggtcgtcaat gcgttcaacg aacctgggtg 480 
cccggtatc 519 


30 <210> 52 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 52 

ggatccgcgg ccgcacaatg aaaaaaatat cttcacatta ttcgg 45 

<210> 53 
40 <211> 40 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 53 

ggatcccctg caggtcattc attgacggca ttaacattgg 40 


<210> 54 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 54 

ggatccgcgg ccgcacaatg ggagcgaatt cgaaatcagt aacg 44 


15 <210> 55 
<211> 40 
<2i2> DNA 

<213> Artificial Sequence 
<220> 

20 <223> Description pf Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 55 

ggatcccctg caggttaata cccactttta tcaagctccc 40 


<210> 56 
25 <211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
30 oligonucleotide primer 

<400> 56 

ggatccgcgg ccgcacaatg tctctattac tggaagagat c 41 


<210> 57 
<211> 41 
35 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 
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<400> 57 

ggatcccctg caggttatgc atcaacagag acacttacag c 


41 


<210> 58 
<211> 41 
5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

10 <400> 58 

ggatccgcgg ccgcacaatg ggctggattc cgtgtccgtg c 41 

<210> 59 
<211> 38 
<212> DNA 
15 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 


<210> 60 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
25 <220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 60 

tcgacctgca ggaagcttag aaatggcgat tttggattc 39 

30 <210> 61 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Description _of Artificial Sequence: 
Oligonucleotide primer 

<400> 61 

ggatccgcgg ccgctcatga catcgatcct tttcgg 35 


20 


<400> 59 

ggatcccctg caggttaacc agaatcaact actttgtg 


38 
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<210> 62 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Annealed 
oligonucleotide adapter 

<400> 62 

cgcgatttaa atggcgcgcc ctgcaggcgg ccgcctgcag ggcgcgccat ttaaat 56 

10 <210> 63 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 63 

tcgaggatcc gcggccgcaa gcttcctgca gg 32 

<210> 64 
20 <211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
25 oligonucleotide 

<400> 64 

tcgacctgca ggaagcttgc ggccgcggat cc 32 

<210> 65 
<211> 32 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

35 <400> 65 

tcgacctgca ggaagcttgc ggccgcggat cc 32 
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<210> 66 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Iiigating 
oligonucleotide 

<400> 66 

tcgaggatcc gcggccgcaa gcttcctgca gg 32 


10 <210> 67 
<211> 36 
<212> DNA 
<213> Artificial 


Sequence 


<220> 

15 <223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 67 

tcgaggatcc gcggccgcaa gcttcctgca ggagct 35 


<210> 68 
20 <211> 28 
<212> DNA 

<213> Artificial Sequence 


<220> 

<223> Description of Artificial Sequence: Ligating 
25 oligonucleotide 

<400> 68 

cctgcaggaa gcttgcggcc gcggatcc 28 


<210> 69 
<211> 36 
30 <212> DNA 

<213> Artificial Sequence 


<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

35 <400> 69 

tcgacctgca ggaagcttgc ggccgcggat ccagct 36 
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<210> 70 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 70 

ggatccgcgg ccgcaagctt cctgcagg 28 

10 <2X0> 71 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 71 

gatcacctgc aggaagcttg cggccgcgga tccaatgca 39 

<210> 72 
20 <211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 


<210> 73 
<211> 2013 
30 <212> DNA 

<213> Arabidopsis thaliana 

<400> 73 

atgcccctta ttcatcggaa aaagccgacg gagaaaccat cgacgccgcc atctgaagag 60 
gtggtgcacg atgaggattc gcaaaagaaa ccacacgaat cttccaaatc ccaccataag 120 

35 aaatcgaacg gaggagggaa gtggtcgtgc atcgattctt gttgttggtt cattgggtgt 180 
gtgtgtgtaa cctggtggtt tcttctcttc ctttacaacg caatgcctgc gagcttccct 240 
cagtatgtaa cggagcgaat cacgggtcct ttgcctgacc cgcccggtgt taagctcaaa 300 
aaagaaggtc ttaaggcgaa acatcctgtt gtcttcattc ctgggattgt caccggtggg 360 
ctcgagcttt gggaaggcaa acaatgcgct gatggtttat ttagaaaacg tttgtggggt 420 

40 ggaacttttg gtgaagtcta caaaaggcct ctatgttggg tggaacacat gtcacttgac 480 
aatgaaactg ggttggatcc agctggtatt agagttcgag ctgtatcagg actcgtggct 540 
gctgactact ttgctcctgg ctactttgtc tgggcagtgc tgattgctaa ccttgcacat 600 
attggatatg aagagaaaaa tatgtacatg gctgcatatg actggcggct ttcgtttcag 660 


25 


oligonucleotide 


<400> 72 

ttggatccgc ggccgcaagc ttcctgcagg t 


31 
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aacacagagg tacgtgatca gactcttagc 
tctaccaacg gtggaaaaaa agcagttata 
ctacatttta tgaagtgggt tgaggcacca 
tggtgtgcaa agtatattaa ggcggtgatg 
5 aaagctgttg cagggctttt ctctgctgaa 
gccccaggat tcttagacac cgatatattt 
atgacacgca catgggactc aacaatgtct 
ggcgggcttg attggtcacc ggagaaaggc 
aacgaaactt gtggtgaagc aggtgaaaac 

10 ggaaggatga tatcttttgg gaaagaagta 
attgattttc gaggtgctgt caaaggtcag 
tggacagagt accatgacat gggaattgct 
tacactgctg gtgaagctat agatctacta 
ggtgccgctc atttctctta tggaattgct 

15 cccaaatact ggtcaaatcc gttagagaca 
tactcattat acggagtggg gataccaacg 
tctcccgaca gttgcatccc ctttcagata 
agctgtctga aagcaggagt ttacaatgtg 
gccgggtaca tgtgtgcaaa agcgtggcgt 

20 aagacttata taagagaata caatcactct 
acgcagagtg gtgcccatgt tgatatcatg 
agggttgccg ccggaggtaa cgggtctgat 
tttgaatggt cggagcgtat tgacctgaag 


cgtatgaaaa gtaatataga gttgatggtt 720 
gttccgcatt ccatgggggt cttgtatttt 780 
gctcctctgg gtggcggggg tgggccagat 840 
aacattggtg gaccatttct tggtgttcca 900 
gcaaaggatg ttgcagttgc cagagcgatt 960 
agacttcaga ccttgcagca tgtaatgaga 1020 
atgttaccga agggaggtga cacgatatgg 1080 
cacacctgtt gtgggaaaaa gcaaaagaac 1140 
ggagtttcca agaaaagtcc tgttaactat 1200 
gcagaggctg cgccatctga gattaataat 1260 
agtatcccaa atcacacctg tcgtgacgtg 1320 
gggatcaaag ctatcgctga gtataaggtc 1380 
cattatgttg ctcctaagat gatggcgcgt 1440 
gatgatttgg atgacaccaa gtatcaagat 1500 
aaattaccga atgctcctga gatggaaatc 1560 
gaacgagcat acgtatacaa gcttaaccag 1620 
ttcacttctg ctcacgagga ggacgaagat 1680 
gatggggatg aaacagtacc cgtcctaagt 1740 
ggcaagacaa gattcaaccc ttccggaatc 1800 
ccgccggcta acctgttgga agggcgcggg 1860 
ggaaactttg ctttgatcga agatatcatg 1920 
ataggacatg accaggtcca ctctggcata 1980 
ctg 2013 


<210> 74 
25 <211> 671 
<212> PRT 

<213> Arabidopeis thaliana 
<400> 74 

Met Pro Leu lie His Arg Lys Lys Pro Thr Glu Lys Pro Ser Thr Pro 
30 1 5 10 15 

Pro Ser Glu Glu Val Val His Asp Glu Asp Ser Gin Lys Lys Pro His 
20 25 30 

Glu Ser Ser Lys Ser His His Lys Lys Ser Asn Gly Gly Gly Lys Trp 
35 40 45 

35 Ser Cys lie Asp Ser Cys Cys Trp Phe lie Gly Cys Val Cys Val Thr 
50 55 60 

Trp Trp Phe Leu Leu Phe Leu Tyr Asn Ala Met Pro Ala Ser Phe Pro 
65 70 75 80 

Gin Tyr Val Thr Glu Arg lie Thr Gly Pro Leu Pro Asp Pro Pro Gly 
40 85 90 95 

Val Lys Leu Lys Lys Glu Gly Leu Lys Ala Lys His Pro Val Val Phe 
100 105 110 

He Pro Gly lie Val Thr Gly Gly Leu Glu Leu Trp Glu Gly Lys Gin 
115 120 125 
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Cys Ala Asp Gly Leu Phe Arg Lys Arg Leu Trp Gly Gly Thr Phe Gly 
130 135 140 

Glu Val Tyr Lys Arg Pro Leu Cys Trp Val Glu His Met Ser Leu Asp 
145 150 155 160 

5 Asn Glu Thr Gly Leu Asp Pro Ala Gly He Arg Val Arg Ala Val Ser 
165 170 175 

Gly Leu Val Ala Ala Asp Tyr Phe Ala Pro Gly Tyr Phe Val Trp Ala 
180 185 190 

Val Leu He Ala Asn Leu Ala His He Gly Tyr Glu Glu Lys Asn Met 
10 195 200 205 

tyr Met Ala Ala Tyr Asp Trp Arg Leu Ser Phe Gin Asn Thr Glu Val 
210 215 220 

Arg Asp Gin Thr Leu Ser Arg Met Lys Ser Asn He Glu Leu Met Val 
225 230 235 240 

15 Ser Thr Asn Gly Gly Lys Lys Ala Val He Val Pro His Ser Met Gly 

245 250 255 

Val Leu Tyr Phe Leu His Phe Met Lys Trp Val Glu Ala Pro Ala Pro 
260 265 270 

Leu Gly Gly Gly Gly Gly Pro Asp Trp Cys Ala Lys Tyr He Lys Ala 
20 275 280 285 

Val Met Asn lie Gly Gly Pro Phe Leu Gly Val Pro Lys Ala Val Ala 
290 295 300 

Gly Leu Phe Ser Ala Glu Ala Lys Asp Val Ala Val Ala Arg Ala He 
305 310 315 320 

25 Ala Pro Gly Phe Leu Asp Thr Asp lie Phe Arg Leu Gin Thr Leu Gin 

325 330 335 

His Val Met Arg Met Thr Arg Thr Trp Asp Ser Thr Met Ser Met Leu 
340 345 350 

Pro Lys Gly Gly Asp Thr lie Trp Gly Gly Leu Asp Trp Ser Pro Glu 
30 355 360 365 

Lys Gly His Thr Cys Cys Gly Lys Lys Gin Lys Asn Asn Glu Thr Cys 
370 375 380 

Gly Glu Ala Gly Glu Asn Gly Val Ser Lys Lys Ser Pro Val Asn Tyr 
385 390 395 400 

35 Gly Arg Met lie Ser Phe Gly Lys Glu Val Ala Glu Ala Ala Pro Ser 

405 410 415 

Glu lie Asn Asn lie Asp Phe Arg Gly Ala Val Lys Gly Gin Ser He 
420 425 430 


SNS0OCID-. <WO 01 16306A2J_: 


WO 01/16308 


44 


PCT/US00/23863 


Pro Asn His Thr Cys Arg Asp Val Trp Thr Glu Tyr His Asp Met Gly 
435 440 445 

He Ala Gly He Lys Ala He Ala Glu Tyr Lys Val Tyr Thr Ala Gly 
450 455 460 

5 Glu Ala He Asp Leu Leu His Tyr Val Ala Pro Lys Met Met Ala Arg 
465 470 475 480 

Gly Ala Ala His Phe Ser Tyr Gly He Ala Asp Asp Leu Asp Asp Thr 
485 490 495 

Lys Tyr Gin Asp Pro Lys Tyr Trp Ser Asn Pro Leu Glu Thr Lys Leu 
10 500 505 510 

Pro Asn Ala Pro Glu Met Glu He Tyr Ser Leu Tyr Gly Val Gly He 
515 520 525 

Pro Thr Glu Arg Ala Tyr Val Tyr Lys Leu Asn Gin Ser Pro Asp Ser 
530 535 540 

15 Cys He Pro Phe Gin He Phe Thr Ser Ala His Glu Glu Asp Glu Asp 
545 550 555 560 

Ser Cys Leu Lys Ala Gly Val Tyr Asn Val Asp Gly Asp Glu Thr Val 
565 570 575 

Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Ala Trp Arg Gly Lys 
20 580 58S 590 

Thr Arg Phe Asn Pro Ser Gly He Lys Thr Tyr He Arg Glu Tyr Asn 
595 600 60S 

His Ser Pro Pro Ala Asn Leu Leu Glu Gly Arg Gly Thr Gin Ser Gly 
^10 615 620 

25 Ala His Val Asp He Met Gly Asn Phe Ala Leu lie Glu Asp He Met 
625 630 635 640 

Arg Val Ala Ala Gly Gly Asn Gly Ser Asp He Gly His Asp Gin Val 
645 650 655 

His Ser Gly He Phe Glu Trp Ser Glu Arg He Asp Leu Lys Leu 
30 660 665 670 


<210:> 75 
<211> 1986 
<212> DNA 

<213> Saccharorayces cerevisiae 
35 <400> 75 

atgggcacac tgtttcgaag aaatgtccag aaccaaaaga gtgattctga tgaaaacaat 60 
aaagggggtt ctgttcataa caagcgagag agcagaaacc acattcatca tcaacaggga 120 
ttaggccata agagaagaag gggtattagt ggcagtgcaa aaagaaatga gcgtggcaaa 180 
gatttcgaca ggaaaagaga cgggaacggt agaaaacgtt ggagagattc cagaagactg 240 
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attttcattc ttggtgcatt cttaggtgta 
gttcataata gcgatagcga cttgtttgac 
tatttggatg attggaaaga tgttctccca 
caggctggta actactccac atcttcttta 
5 aaacaactct tacgtgatta taatatcgag 
gtcatttcta cgggaattga aagctgggga 
gcgcattttc gtaaacggct gtggggaagt 
aaagtttgtt ggttgaaaca tgtaatgtta 
tttacgctac gtgcagcaca gggcttcgaa 

10 atttggaaca aagttttcca aaatctggga 
agtgctgcgt atgattggag gcttgcatat 
acgaagctaa aggaacaaat cgaactgttt 
attggacatt ctatgggttc tcagattatc 
ggccctcttt acggtaatgg tggtcgtggc 

15 aatgcagcag ggacgcttct gggcgctcca 
atgaaagata ccattcaatt aaatacgtta 
agaattgaga gagtaaaaat gttacaaacg 
ggagaagagg tcatttgggg ggatatgaag 
actgacacat acggcaattt cattcgattt 

20 aatttgacaa tgaaagacgc cattaacatg 
agaagagtac atgagcagta ctcgttcggc 
aatgagctac accacaagca ctggtcgaat 
cacatgaaaa tctattgtat atacggggtg 
aaggaagagg atgactcctc tgctctgaat 

25 gtattcctca ccgaggggga cggaaccgtt 
tgggcccagg gtgcttcacc gtacaaccct 
aaacaecagc cagatcgatt tgatatacgt 
atcctcggca gcgcggagtt gaacgattac 
ctcgtcgagc cacgccaatt gtctaatttg 

30 atgtaa 


cttttgccgt ttagctttgg cgcttatcat 300 
aactttgtaa attttgattc. acttaaagtg 360 
caaggtataa gttcgtttat tgatgatatt 420 
gatgatctca gtgaaaattt tgccgttggt 480 
gccaaacatc ctgttgtaat ggttcctggt 54 0 
gttattggag acgatgagtg cgatagttct 600 
ttttacatgc tgagaacaat ggttatggat 660 
gatcctgaaa caggtctgga cccaccgaac 720 
tcaactgatt atttcatcgc agggtattgg 780 
gtaattggct atgaacccaa taaaatgacg 840 
ttagatctag aaagacgcga taggtacttt 900 
catcaattga gtggtgaaaa agtttgttta 960 
ttttacttta tgaaatgggt cgaggctgaa 1020 
tgggttaacg aacacataga ttcattcatt 1080 
aaggcagttc cagctctaat tagtggtgaa 1140 
gccatgtatg gtttggaaaa gttcttctca 1200 
tggggtggta taccatcaat gctaccaaag 1260 
tcatcttcag aggatgcatt gaataacaac 1320 
gaaaggaata cgagcgatgc tttcaacaaa 1380 
acattatcga tatcacctga atggctccaa 1440 
tattccaaga atgaagaaga gttaagaaaa 1500 
ccaatggaag taccacttcc agaagctccc 1560 
aacaacccaa ctgaaagggc atatgtatat 1620 
ttgaccatcg actacgaaag caagcaacct 1680 
ccgctcgtgg cgcattcaat gtgtcacaaa 1740 
gccggaatta acgttactat tgtggaaatg 1800 
ggtggagcaa aaagcgccga acacgtagac 1860 
atcttgaaaa ttgcaagcgg taatggcgat 1920 
agccagtggg tttctcagat gcccttccca 1980 

1986 


<210> 76 
<211> 661 
<212> PRT 

<213> Saccharomyces cerevisiae 
35 <400> 76 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
1 5 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
2Q 25 30 

40 Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 

Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
45 «5 70 75 80 

He Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 
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Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 no 

Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

5 Leu Pro Gin Gly He Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 
130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pro Val Val 
10 165 170 175 

Met Val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 
180 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

15 Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
20 245 250 255 

Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
260 265 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

25 Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
250 295 300 

Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

He Gly His Ser Met Gly Ser Gin He He Phe Tyr Phe Met Lys Trp 
30 3 25 330 335 

Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

Asn Glu His He Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 

35 Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 375 380 

lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 
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Arg lie Glu Arg Val Lys Met Leu 
405 

Met Leu Pro Lys Gly Glu Glu Val 
420 

5 Ser Glu Asp Ala Leu Asn Asn Asn 
435 440 

Arg Phe Glu Arg Asn Thr Ser Asp 
450 455 

Lys Asp Ala He Asn Met Thr Leu 
10 465 470 


Gin Thr Trp Gly Gly He Pro Ser 
410 415 

He Trp Gly Asp Met Lys Ser Ser 
425 430 

Thr Asp Thr Tyr Gly Asn Phe He 
445 

Ala Phe Asn Lys Asn Leu Thr Met 
460 

Ser He Ser Pro Glu Trp Leu Gin 
475 480 


Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 

15 Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys He Tyr 
515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr He Asp Tyr Glu Ser Lys Gin Pro 
20 545 550 555 560 

Val Phe Leu Thr Glu Gly. Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 

Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 

25 He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 
610 615 620 

Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
30 625 630 635 640 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

Met Pro Phe Pro Met 
660 
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<210> 77 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 77 

ggatccgcgg ccgcacaatg ccccttattc atcgg 35 


10 <210> 78 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 78 

ggatcccctg caggtcacag cttcaggtca atacg 35 

<210> 79 
20 <211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
25 oligonucleotide primer 

<400> 79 

ggatccgcgg ccgcacaatg ggcacactct ttcgaag 37 

<210> 80 
<211> 39 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

35 <400> 80 

ggatcccctg caggttacat tgggcacact gtttcgaag 39 
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